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Introduction 


This book is the second volume of a full and detailed course in the elements 
of real and complex analysis that mathematical undergraduates may expect 
to meet. Indeed, it was initially based on those parts of analysis that under- 
graduates at Cambridge University meet, or used to meet, in their first two 
years. There is however always a temptation to go a bit further, and this 
is a temptation that I have not resisted. Thus I have included accounts of 
Baire’s category theorem, and the Arzela—Ascoli theorem, which are taught 
in the third year, and the Mazur—Ulam theorem, which, as far as I know, 
has never been taught. As a consequence, there are certain sections that can 
be omitted on a first reading. These are indicated by asterisks. 

Volume I was concerned with analysis on the real line. In Part Three, 
the analysis is extended to a more general setting. We introduce and con- 
sider metric and topological spaces, and normed spaces. In fact, metric and 
metrizable spaces are sufficient for all subsequent needs, but many of the 
properties that we investigate are topological properties, and it is well worth 
understanding what this means. The study of topological spaces can degen- 
erate into the construction of pathological examples; once again, temptation 
is not resisted, and Section 11.6 contains a collection of these. This section 
can be omitted at a first reading (and indeed at any subsequent reading). 
Baire’s category theorem is proved in Section 12.6; it is remarkable that a 
theorem with a rather easy proof can lead to so many strong conclusions, 
but this is another section that can be omitted at a first reading. The notion 
of compactness, which is a fundamental topological idea, is studied in some 
detail. Tychonoff’s theorem on the compactness of the product of compact 
spaces, which involves the axiom of choice, is too hard to include here: a 
proof is given in Appendix D. 
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In Part Four, we come back down to earth. The principal concern is the 
differentiation and integration of functions of several variables. Differentia- 
tion is interesting and reasonably straightforward, and we consider functions 
defined on a normed space; this shows that the results do not depend on 
any particular choice of coordinate system. Integration is another matter. 
To begin with it seems that the ideas of Riemann integration developed 
in Part Two carry over easily to higher dimensions, but serious prob- 
lems arise as soon as a non-linear change of variables is considered. It is 
however possible to establish results that suffice in a great number of con- 
texts. For example, the change of variables results are used in Volume III, 
where we introduce the Lebesgue measure, and the corresponding theory of 
integration. These results on differentiation and integration are applied in 
Chapter 19, where we consider subspaces of a Euclidean space which are 
differential manifolds — subspaces which locally look like Euclidean space. 

This volume requires the knowledge of some elementary results in linear 
algebra; these are described and established in Appendix B. 

The text includes plenty of exercises. Some are straightforward, some are 
searching, and some contain results needed later. All help to develop an 
understanding of the theory: do them! 

I am extremely grateful to Zhuo Min ‘Harold’ Lim who read the proofs, 
and found embarrassingly many errors. Any remaining errors are mine alone. 
Corrections and further comments can be found on a web page on my 
personal home page at www.dpmms.cam.ac.uk. 
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Metric and topological spaces 
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Metric spaces and normed spaces 


11.1 Metric spaces: examples 


In Volume I, we established properties of real analysis, starting from the 
properties of the ordered field R. of real numbers. Although the fundamental 
properties of R depend upon the order structure of R, most of the ideas and 
results of the real analysis that we considered (such as the limit of a sequence, 
or the continuity of a function) can be expressed in terms of the distance 
d(x,y) = |x — y| defined in Section 3.1. The concept of distance occurs in 
many other areas of analysis, and this is what we now investigate. 

A metric space is a pair (X,d), where X is a set and d is a function 
from the product X x X to the set R* of non-negative real numbers, which 
satisfies 


1. d(x, y) = d(y, x) for all x,y € X (symmetry); 
2. d(x,z) < d(x, y) + d(y, z) for all x,y,z € X (the triangle inequality); 
3. d(x, y) = 0 if and only if x = y. 


d is called a metric, and d(x, y) is the distance from x to y. The conditions 
are very natural: the distance from x to y is the same as the distance from 
y to x; the distance from x to y via z is at least as far as any more direct 
route, and any two distinct points of X are a positive distance apart. 

Let us give a few examples, to get us started. 


Example 11.1.1 R, with the metric d(x, y) = |x — y|, is a metric space, 
as is C, with the metric d(z, w) = |z — w|. 


This was established in Volume I, in Propositions 3.1.2 and 3.7.2. These 
metrics are called the usual metrics. If we consider R or C as a metric space, 
without specifying the metric, we assume that we are considering the usual 
metric. 
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Example 11.1.2 The Euclidean metric. 


We can extend the ideas of the previous example to higher dimensions. 
We need an inequality. 


Proposition 11.1.3 (Cauchy’s inequality) Jf x,y € R?@ then 
L L 
d d 2 d 2 
2 5 
Let S|} | Ue 
j=l j=l j=l 
Equality holds if and only if xiyj = 2jy; for 1 <1,9 < d. 


Proof We give the proof given by Cauchy in 1821, using Lagrange’s 
identity: 


2 

d d d 

dius | + DL (wings — yu)? = | doa] (doy; |. 
j=l {(i,j) <i} j=l j=l 

which follows by expanding the terms. This clearly establishes the inequality, 

and also shows that equality holds if and only if x;y; =a,;y; for 1<i,j <d. 


Corollary 11.1.4 If 2z,y € R%, let d(x,y) = CoCr — y;)°)¥/?. Then d 
is a metric on R¢. 


Proof Conditions (i) and (iii) are clearly satisfied. We must establish the 
triangle inequality. First we use Cauchy’s inequality to show that if a,b € R?, 
then d(a + 6,0) < d(a,0) + d(b, 0): 


d 
d(a + 6,0)? = S"(aj +;)? 
j=l 
d d d 
= DG +2 > aybj + D7; 
o=% j=l g=1 


< d(a,0)* + 2d(a,0).d(b, 0) + d(b, 0)? = (d(a,0) + d(b,0))?. 


Note that it follows from the definitions that d is translation invariant: 
d(a,b) = d(a+c,b+c). In particular, d(a,b) = d(a — b,0). If x,y,z € R%, 
seta=x—yandb=y-—z,sothata+b=x-—z. Then 


d(x, z) =d(a+b,0) < d(a,0) + d(b,0) = d(x, y) + d(y, z). 
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The metric d is called the Euclidean metric, or standard metric, on R¢. 
When d = 2 or 3 it is the usual measure of distance. 
We can also consider complex sequences. 


Corollary 11.1.5 If z,w € C%, let d(z,w) = OS |z; — w,|?)/?. Then 
d is a metric on C2. 
Proof Again, conditions (i) and (iii) are clearly satisfied, and we must 


establish the triangle inequality. First we show that if z,w € C®% then 
d(z + w,0) < d(z,0) + d(w, 0): using the inequality of the previous corollary, 


, 1/2 P 1/2 
d(z+w,0)= So zg + wy]? < So (leg + fwyl)? 
j=l j=l 
; 1/2 r 1/2 
<| Sole? } + ( Solwi?) = dlz,0) + d(w,0). 
j=l j=l 


Again d is translation invariant, so that d(r,s) = d(r — s,0). If r,s,t € C4 
let z=r—sandw=s-—t,so that z+w=r-—tand 


d(r,t) = d(z + w,0) < d(z,0) + d(w,0) = d(r, s) + d(s, t). 


The metric d is called the standard metric on C?. 
We shall study these metrics in more detail, later. 


Example 11.1.6 The discrete metric. 


Let X be any set. We define d(z,y) =lifa Ay and d(z,y)=O0ifx=y. 
Then d is a metric on X, the discrete metric. If « € X, there are no other 
points of X close to x; this means, as we shall see, that analysis on X is 
rather trivial. 


Example 11.1.7 The subspace metric. 


If (X,d) is a metric space, and Y is a subset of X, then the restriction of 
dto Y x Y isa metric on Y. This metric is the subspace metric on Y, and 
Y, with the subspace metric, is called a metric subspace of (X,d). 

The subspace metric is a special case of the following. Suppose that (X, d) 
is a metric space and that f is an injective mapping of a set Y into X. If we 
set p(y, y’) = d(f(y), f(y’)) then it is immediately obvious that p is a metric 
on Y. For example, we can give (—7/2,7/2) the usual metric, as a subset 
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of R. The mapping j = tan~! is a bijection of R. onto (—1/2, 7/2). Thus if 
we set 


oly, y’) = |5(y) — (y’)| = |tan*(y) — tan“! (y’)], 


then p is a metric on R. 
Example 11.1.8 <A metric on the extended real line R. 


We can extend the mapping j of the previous example to R by set- 
ting j(—oo) = —a/2 and j(+oo) = —2a/2. Then j is a bijection of 
R onto [—7/2,7/2], and we can again define a metric on R by setting 
p(y, y') = |(y) — i(y’)|. Thus 


ply, co) = 7/2 — tan~*(y), 
p(—0o, y) = tan" (y) + 7/2 


and p(—oo,00) = 7. 
Example 11.1.9 A metric on N= NU {oo}. 


Here is a similar construction. If n € N, let f(n) = 1/n, and let 
f(4+00) =0. f is an injective map of N onto a closed and bounded subset 
of R. Define p(x, x’) = |f (x) — f(2’)|. This defines a metric on N: 


p(m,n) = |1/m —1/n| and p(m, co) = 1/m. 
Example 11.1.10 The uniform metric. 


There are many cases where we define a metric on a space of functions. 
Here is the first and most important example. First we need some definitions. 
Suppose that B is a non-empty subset of a metric space (X,d). The diameter 
diam (B) of B is defined to be sup{d(b, b') : b,b’ € B}. The set B is bounded 
if diam(B) < oo. If f : S — (X,d) is a mapping and A is a non-empty 
subset of S, we define the oscillation Q(f, A) of f on A to be the diameter of 
f(A): Q(f, A) = sup{d(f (a), f(a’)) : a,a’ € A}. The function f is bounded if 
O(f, S) = diam (f(S)) < oo. 


Proposition 11.1.11 Let B(S,X) = Bx(S) denote the set of all 
bounded mappings f from a non-empty set S to a metric space (X,d). If 


fig € Bx(S), let do(f,g) = sup{d(f(s),g(s)) : s € S}. Then dy is a 
metric on Bx(S). 
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Proof First we show that d.(f,g) is finite. Let so € S. If s € S then, by 
the triangle inequality, 


A f(s), 9(s)) < a(F(s), F(s0)) + a(F(s0), 9(80)) + 4(g(s0), 9(s)) 
= OF, S) + d(f (so), 9(So)) + 24g, S). 
Taking the supremum, 
doo(f, 9) = OF, S) om d(f (so), 9(sSo)) =P X49, S) <&. 


Conditions (i) and (iii) are clearly satisfied, and it remains to establish the 
triangle inequality. Suppose that f,g,h € Bx(S) and that s € S. Then 


d( f(s), h(s)) < d(f(s), 9(s)) + d(g(s), R(s)) < doolf, 9g) + doolg, h). 
Taking the supremum, dxo(f,h) < do(f,g) + do(g,h). 


This metric is called the uniform metric. 
Example 11.1.12 Pseudometrics. 


We shall occasionally need to consider functions p for which the third 
condition in the definition of a metric is replaced by the weaker condition 

(3) at ¢ = y then p(2,y) = 0. 

In other words, we allow distinct points to be zero p-distance apart. Such 
a function is called a pseudometric. It is easy to relate a pseudometric to a 
metric on a quotient space. 


Proposition 11.1.13 Suppose that p is a pseudometric on a set X. The 
relation on X defined by setting x ~ y if d(x, y) = 0 is an equivalence relation 
on X. Let q be the quotient mapping from X onto the quotient space X/~. 
Then there exists a metric d on X/ ~ such that d(q(x),q(y)) = p(x, y) for 
ryEeEXx. 


Proof The fact that ~ is an equivalence relation on X is an immediate 
consequence of the symmetry property and the triangle inequality. We need 
a lemma, which will be useful elsewhere. 


Lemma 11.1.14 Suppose that p is a pseudometric, or a metric, on a set 
X, and that a,a’,b,b' € X. Then 


|p(a, b) — p(a’,b')| < p(a,a’) + p(b, 0’). 
Proof Using the triangle inequality twice, 


p(a, b) < p(a,a’) + p(a’,b) < p(a,a’) + p(a’,’) + p(b', b), 
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so that 


pa, b) -_ pla’, b’) = p(a, a’) + pb, b'). 


Similarly 


which gives the result. 


pla’, b") > pla, b) < pla, a’) + po, b'), 


We now return to the proof of the proposition. If a ~ a’ and b ~ Db! 
then it follows from the lemma that p(a,b) = p(a’,b’). Thus if we define 
d(q(a),q(b)) = p(a,b), this is well-defined. Symmetry and the triangle 
inequality for d now follow immediately from the corresponding proper- 
ties of p. Finally if d(q(a),q(b)) = 0 then p(a,b) = 0, so that a ~ b and 


q(a) = q(0). 


We shall meet more examples of metric spaces later. 


11.1.1 


11.1.2 


11.1.3 


11.1.4 


11.1.5 


Exercises 


If x,y € (0,27), let d(z,y) = min(|z — y|, 2a — |x — y|). Show that 
d is a metric on [0,27). Define f : [0,27) + R? by setting f(x) = 
(cos z,sinaz). Let p(f(x), f(y)) = d(a,y). Show that p is a metric on 
f((0,27)), the arc-length metric. 

Suppose that p is a prime number. If r is a non-zero rational number, 
it can be written uniquely as r = p’(")s/t, where u(r) € Z and s/t is 
a fraction in lowest terms, with neither s nor ¢t having p as a divisor. 
Thus if p = 3 then v(6/7) = 1 and v(5/18) = —2. Let n(r) = po. 
If r,r’ € Q, set dp(r,r’) = n(r — 1’) for r £ r’ and d,(r,7r’) = 0 if 
r =r’. Show that d is a metric on Q. This metric, the p-adic metric, 
is useful in number theory, but we shall not consider it further. 

As far as I know, the next example is just a curiosity. Consider R@ 
with its usual metric d. If « = ay for some a € R (that is, x and 
y lie on a real straight line through the origin) set p(x, y) = d(x, y); 
otherwise, set p(z,y) = d(xz,0) + d(0,y). Show that p is a metric 
on R4. 

Let P, be the power set of {1,...,n}; the set of subsets of {1,...,n}. 
Let h(A, B) = |AAB|, where AAB is the symmetric difference of A 
and B. Show that h is a metric on P,, (the Hamming metric). 

Let P(N) be the set of subsets of N. If A and B are distinct subsets 
of N, let d(A, B) = 2-4, where j = inf(AAB), and let d(A, A) = 0. 
Show that d is a metric on P(N) and that 


d(A,C’) < max(d(A, B),d(B,C)) for A,B,C € P(N). 
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11.1.6 Let f be the real-valued function on the extended real line R defined 
by f(a) = a/V1+4+ 2? if « € R, f(+oo) = 1 and f(—oo) = —1. If 
x,y ER, let d(x,y) =|f(x) — f(y)|. Show that d is a metric on R. 
Show that a sequence (a,,)°°., of real numbers converges to +00 as 
n — oo if and only if d(a,, +00) > 0 as n > co. 


11.2 Normed spaces 


Many of the metric spaces that we shall consider are real or complex vector 
spaces, and it is natural to consider metrics which relate to the algebraic 
structure. We shall assume knowledge of the basic algebraic properties of vec- 
tor spaces and linear mappings; these are described in Appendix B. Suppose 
that E is a real or complex vector space. It is then natural to consider those 
metrics d which are translation-invariant: that is, d(x + a,y +a) = d(x,y) 
for all z,y,a € E. Note that this implies that 


d(z,y) = d(x — y,0) = d(0,x — y) =d(—a, —y). 


It is also natural to require that if we multiply by a scalar then the distance 
is scaled in an appropriate way: d(Az, Ay) = |Ald(x,y) for all x,y € E and 
all scalars A. It is easy to characterize such metrics. 

A real-valued function x — ||z|| on a real or complex vector space E is a 
norm if 


1. lla + yl] < [le|| + llyll (p is subadditive), 
2. |lax|| = |a| ||z|| for every scalar a and 
3. ||x|| = 0 if and only if s =0, 


for a a scalar and x,y vectors in E. (EF, ||.||) is then called a normed space. 
Note that ||xz|| = |/—a|| and that |/0|| = |/0.0|] = 0|/O|| = 0. A norm is 
necessarily non-negative, since 0 = ||0|| < ||z|| + ||—2|| = 2 ||z]]. 

A subset C of areal or complex vector space is convex if whenever x,y € C 
and 0 <¢t <1 then (1—t)x+ty € C. A real-valued function on a convex 
subset C' of a real or complex vector space EF is convex if whenever x,y € C 
and 0<t< 1 then 


f(A t)a + ty) < (1 —t) f(a) + tf(y). 


Proposition 11.2.1 /f||.|| is a norm on a real or complex vector space E, 
then ||.|| 7s a convex function on E. 


Proof Suppose that 7,y € FE and0Q<t< 1. Then 


(1 — t)a + tyl] < I]. — all + [ltyll = — 4) Ilel| +e lly. 
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Corollary 11.2.2 The sets U = {x: ||x|| < 1} and B= {z: ||x|| < 1} are 
convex subsets of E. 


Theorem 11.2.3 Suppose that d is a metric on a real or complex vector 
space ;. Then the following are equivalent: 

(i) d is translation-invariant and satisfies d(Ax,Ay) = |A\d(x,y) for all 
x,y € E and all scalars 4; 

(it) there exists a norm ||.|| on E such that d(x,y) = ||xz — y]]. 


Proof If d is a translation-invariant metric with the desired scaling 
properties, and we set ||z|| = d(x,0), then ||z|| = 0 if and only if « = 0, 


= d(x, 0) + d(y,0) = ||x|| + Ilyll, 


and 
||Ax|| = d(Ax, 0) = d(Ax, 0) = |Ald(a, 0) = |AI |||] - 


Thus (i) implies (ii). 
Conversely, suppose that ||.|| is a norm on F, and that we set d(z,y) = 
|| — y||. First we show that d is a metric on E: 


d(a,y) = |v — yl = lly — #]| = dy, 2), 


d(x, y) = 0 if and only if ||” — y|| = 0, if and only if « — y = 0, if and only if 
x= y, and 


d(x, z) = |x — 2|| = |(@-y) + y- 2) 
< ||z — yll + lly — zll = d(x, y) + dy, 2), 


so that the triangle inequality holds. Further, 
d(x + a,y + a) = ||(x + a4) — (y+a)|| = |x — yl] = d(z, y) 
so that d is translation invariant, and 


d(Aa, Ay) = ||Aw — Ayl| = ||A@ — yl = [Al la — yll = Ald@, y). 


Thus (ii) implies (i). 


A vector x in a normed space (F, ||.||) with ||a|| = 1 is called a unit vector. 
If y is a non-zero vector in EF, then y = Ay, where A = |ly|| and yj; is the 
unit vector y/ |ly|l. 
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Let us give some examples of norms. 


Theorem 11.2.4 The function ||z||, = Os mi)? is a norm on R4 
(the Euclidean norm). 


Proof — Clearly ||x||, = 0 if and only if « = 0 and ||Az||, = |A| ||z||,. Let d 
be the Euclidean metric on R%. Then, as in Corollary 11.1.4, 


Iz + ylly = d(a + y,0) < d(w, 0) + dy, 0) = lal + Ilylla; 


so that ||.||, is subadditive. 
In the same way, we have the following. 


Theorem 11.2.5 The function ||z||, = ae |z;|?)1/2 is a norm on C4, 


The norm ||.||, on C4 is again called the Euclidean norm. 
We shall generalize these two examples in the next section. 
Suppose that (£, ||.||) is a normed space. Then, since 


lla — yll < lal + IIyll and |lyll S [ly — all + lel, 


a subset B of E is bounded if and only if sup{||b|| : b € B} < 00; we say 
that B is norm bounded, or bounded in norm. 

Thus if S is a set and (EF, ]||.||) is a normed space, then the set Bg(S) = 
{f:S — E: f(S) is bounded} is equal to the set {f : S > EF: sup{||f(s)|| : 
s € S} < oo}. Further, Bg(S) is a vector space, when addition and scalar 
multiplication are defined pointwise: 


(f + 9)(s) = f(s) + g(s) and (Af)(s) = A(F(s))- 


Arguing as in Proposition 11.1.11, 


II Flloo = doo(f, 0) = sup{]| f(s) 


is anorm on Bp(S), and dx(f,g) = ||f — gll,., for f,g € Be(S). The norm 
I|.||, is called the uniform norm. We denote the normed space (Bz(S), ||.||,,) 
by 1(S,E). When E = R or C, and the context is clear, we write 
loo(S) for loo(S,R) or loo(S,C). We denote Iao(N) (or loo(ZT)) by loo, and 
lsc tl yssagthl) by TE. 

As with pseudometrics, we occasionally need to consider functions on a 


:s ES}, 


vector space which do not satisfy the third condition for a norm. A seminorm 
p on a vector space F is a real-valued function which satisfied the first two 
conditions for a norm, but not necessarily the third. Note that, as for a 
norm, p(x) = p(—2), p(0) = 0 and p(x) > 0 for all x € E. 
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Proposition 11.2.6 If p is a seminorm on a vector space E’, then the set 
{x € E: p(x) = 0} is a linear subspace N of E. 


Proof If X is a scalar and z,y € N then p(Ax) = |Alp(x) = O and 
0<p(a+y) < p(x) + p(y) =0. 


If p is aseminorm on a vector space EF, then the function 7(z, y) = p(x—y) 


is a pseudometric on EF. If ~ is the equivalence relation which this defines, 
then x ~ y if and only if x — y € N. Thus the quotient space E’/ ~ is the 
quotient vector space E/N, and q is a linear mapping of F onto E/N. If we 
set ||q(x)|| = p(x), then ||.|| is a norm on E/N which defines the metric of 
Proposition 11.1.13. 


Exercises 


11.2.1 Suppose that f is a real-valued function on a convex subset C' of a real 
or complex vector space E. Show that the following are equivalent. 
(a) f is a convex function on C. 

(b) The set {(c,t) € Cx R: f(c) < t} is a convex subset of E x R. 
(c) The set {(c,t) €C x R: f(c) < t} is a convex subset of E x R. 

11.2.2 If x = (21,...,24) € R4, let ||z||, = ae |x;|. Show that |].||; is a 
norm on R4. The normed space (R4%, |j.||,) is denoted by 1¢(R). Prove 
a similar result in the complex case. 

11.2.3 Let 11(R) denote the set of real sequences (a,,)°2_, for which }7°°., |an| 
is finite. Show that 1,(R) is a real vector space (with the algebraic 
operations defined pointwise) and that the function |la||, = }>?-., |an| 
is a norm on 1,(R). 


11.3 Inner-product spaces 


In 1885, Hermann Schwarz gave another proof of Cauchy’s inequality, this 
time for two-dimensional integrals. Schwarz’s proof is quite different from 
Cauchy’s, and extends to a more general and more abstract setting. This 
provides some important examples of normed spaces. 

Suppose that V is a real vector space. An inner product on V is a real- 
valued function (x,y) > (x,y) on V x V which satisfies the following: 

(i) (bilinearity) 


(a4 21 + Q2%9,y) = a4 (@1,y) + a2 (2, y) , 
(x, B1y1 + Boye) = 61 (@, y1) + Bo (x, yo), 


for all x, 21,22, y, 41, yo in V and all real a1, a2, 31, Bo; 
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(ii) (symmetry) 
(ya) ={e,9) forall 29m V; 
(iii) (positive definiteness) 
(x, x) > 0 for all non-zero x in V. 


For example, if V = R%, we define the usual inner product, by setting 
(2,0) = Sy e0,; for 2 = (27), = (w,). 

A function which satisfies (i) and (ii) is called a symmetric bilinear form. 

Similarly, an inner product on a complex vector space V is a function 
(x,y) > (x,y) from V x V to the complex numbers C which satisfies the 
following: 

(i’) (sesquilinearity) 


(0121 + a2%2,y) = a1 (21, y) + a2 (22,Y) , 

(x, Bry + Boya) = Bi (x, y1) + Bo (x, ya) , 
for all x, 21,22, y,Y1, y2 in V and all complex ay, a2, 31, 32 (note that com- 
plex conjugation is applied to the second term; theoretical physicists do it 


the other way round); 
(ii’) (the Hermitian condition) 


(y,2) = (a, y) for all x,y in V; 
(iii!) (positive definiteness) 
(x,x) > 0 for all non-zero x in V. 


2): 


A function which satisfies (i’) and (ii’) is called a Hermitian bilinear form. 
Note that it follows from (ii’) that 


For example, if V = C%, we define the usual inner product, by setting 
Ct) = ee zw; for z = (z%),w = (w,). As another example, the space 
C|a, b] of continuous (real or) complex functions on the closed interval [a, }| 
is an inner-product space when the inner product is defined by 


b 
oe iy Pasay ae 


A (real or) complex vector space V equipped with an inner product is 


1 
called an inner-product space. If x is a vector in V, we set ||x|| = (x, x) 2. We 
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shall show that ||.|| is a norm on V. Certainly ||z|| = 0 if and only if « = 0, 
and ||Az|| = |A| [|x|]. 

In what follows, we shall consider the complex case: the real case is easier, 
since we do not need to consider complex conjugation. 


Proposition 11.3.1 (The Cauchy—Schwarz inequality) Jf x and y are 
vectors in an inner-product space V then 


[(e,y) |S [lell - lly, 


with equality if and only if x and y are linearly dependent. 


Proof ‘This depends upon the quadratic nature of the inner product. The 
inequality is trivially true if (x, y) = 0. If ||x|| = 0, then z = 0 and (x,y) = 0, 
so that the inequality is true, and the same holds if ||y|| = 0. 

Otherwise, if A € C then 


0 < |la + Ayl? = (a + Ay, x + Ay) 
= (x,x) +X (x,y) +r {y, 2) + |A?? (yy) - 


Put 
_ (x,y) [zl 
L(z,y) | [yl 


It follows that 


0 sj? — LOE Nel MEA yy? = 2 (a? —| xp LL) 
< Le.) lull Wyle 


so that | (x,¥)| < |||] - lly. 
If « = 0 or y = 0, then equality holds, and x and y are linearly dependent. 
Otherwise, if equality holds, then ||z + Ay|| = 0, so that r+ Ay = 0, and x 
and y are linearly dependent. Conversely, if x and y are linearly dependent, 
then « = ay for some scalar a, and so 


2 
| (v,¥) | = lal iyi” = Tall - lly 


Note that we obtain Cauchy’s inequality by applying this result to R% or 
C?¢, with their usual inner products. 


Corollary 11.3.2 ||x+-y|| < ||z|| + |ly||, with equality if and only if either 
y=0 orx=ay, witha> 0. 
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Proof We have 


2 2 2 
Ila + yl = Mell” + ey) + (y, &) + Ilyll 
2 2D: 
< |lell’ +2 [lel] yl + lyll’ = (ell + Ilyll)?. 


Equality holds if and only if R(x, y) = ||x||. |ly|], which is equivalent to the 
condition stated. 


Thus ||.|| is a norm on V. 
Note also that the inner product is determined by the norm: in the real 
case, we have the polarization formulae 


2 2 2 2 2 
(z,y) = (lla + yl? — lel — lly) = 20le + 9lP - We — 9), 
and in the complex case we have the polarization formula 
(v9) =a {Dl #||2+eyl | 
j=0 
We also have the following. 


Proposition 11.3.3 (The parallelogram law) Jf x and y are vectors in an 
inner-product space V, then 


Iz + yll? + lle — yll? = lial? + 2llyll?. 


Proof For 


lle + yll? + lle — yll? = (a, 2) + (x,y) + (y, 2) + yy) + 
+((0, 2) = (9,4) = (Ya) yy) 
= 2|lal|? + 2\lyl|?. 


Many of the geometric and metric properties of inner-product spaces can 
be expressed in terms of orthogonality. Vectors « and y in an inner-product 
space V are said to be orthogonal if (x,y) = 0; if so, we write x_Ly. For real 
spaces, this property can be expressed metrically, in terms of the norm. 


Proposition 11.3.4 Jf ax andy are vectors in a real inner-product space 
V then «Ly if and only éf ||x + y||? = |lx||? + |lyl|*. 


Proof For ||x + yl|? = |lall? + llyll? + 2 (2,9). 
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On the other hand, if x is a vector in a complex inner-product space, 
lz + éx||? = 2 |al|? = ||x||? + lléa||?, while (x, ix) = —é|[a?. 
If A is a subset of an inner-product space V, we set 


At ={ €V: (a,x) =0 for all a € A}. 


We write a+ for {v}+. A+ is the annihilator of A: it has the following 
properties. 


Proposition 11.3.5 Suppose that A and B are subsets of an inner-product 
space V. 


1. A> ={x EV: (z,a) =0 for allae A}. 
2. A~ is a linear subspace of V. 

3. If AC B then B+ C At. 

4, AC A+, 
5 
6 


Aa =AS 
. ANA~ = {0}. 


Proof ‘These all follow easily from the definitions. For example, to prove 5, 
At c (A+)++, by (iv), while (At+)+ C A+, by (iii), since A C At. 


Suppose that x is a unit vector in V, and that z € V. Let A = (z, 2) and 
let y = z—Az. Then (y, x) = (z,x)—(z, x) (x, 2) = 0. Thus z = Ax+y, where 
Ag € span(x) and y € xt. If z = px + w, with w € xt, then (z,2) = p, so 
that p = A and w = y; the decomposition is unique. 


Exercises 


11.3.1 Let Iz denote the set of real sequences (a,,)°2, for which 37°C, |an|? 
is finite. Show that [2 is a vector space (with the algebraic operations 
defined pointwise), that if a,b € lg then \°*°_, anbn converges abso- 
lutely, and that the function (a,b) + (a,b) = )°°°, anbn is an inner 
product on lg. 

11.3.2 Establish corresponding results for complex sequences. 

11.3.3 Let x, y and z be elements of a real inner-product space, such that 
|x — z|| = ||x — y|| + ||y — z||. Show that there exists 0 < A < 1 such 
that y = (1—A)a + Xz. 

11.3.4 A pre-inner-product space is a vector space £ with a symmetric bilin- 
ear (Hermitian sesquilinear) form (.,.) which is positive semi-definite: 
(c,z) > 0 for all ¢ € FE. Show that N = {zr € E: (2,2) =O} isa 


11.4 Euclidean and unitary spaces 317 


linear subspace of E, and that if g: E + E/N is the quotient map- 
ping then there exists an inner product (.,.),y on E/N such that 


(q(x), a(¥)) vy = (2, y), for zy € E. 


11.4 Euclidean and unitary spaces 


We now restrict attention to finite-dimensional spaces; a finite-dimensional 
real inner-product space is called a Euclidean space and a finite-dimensional 
complex inner-product space is called a unitary space. Throughout this 
section, V will denote a Euclidean or unitary space of dimension n. The 
key idea is that of Gram-—Schmidt orthonormalization. 


Theorem 11.4.1 Suppose that (x1,...,vq) is a basis for a Euclidean 
or unitary space V. Then there exists a basis (e1,...,ea) for V with the 
following properties. 

(a) \|heg|| = 1 for l= 9 <d and (e;,e;) = 0 fori <1< 7 <d. 

(a) Lf W; = span (15.4 62; ), then W; = span (6i;..«;4;), for 1 <9 sad. 
Proof The proof is by an iterative construction. We set e, = 21/ ||x1||. 
Suppose that we have constructed e€),...,¢j;-1, satisfying the conclusions of 
the theorem. Let fj = xj — eS (29, Gj) ej: Since By € Wea, J; = 0. Let 
ej = fj/\|fjl]- Then |le;|] = 1, and 


span (€1,...e;) = span (W;_1, e;) = span (Wj_1, 2;) = Wj. 


Thus (€1,...,€n) is a basis for W;. If 1 << k < j then 


j-1 
(tee) = (Deven) -Si( (72) (Cis Gk) = (25yeR) = (es@h) = 0, 
i=1 


so that (e;,e,) = 0. Thus (e1,...,e;) is a basis for W;. In particular, 
(e€1,..-,€q) is a basis for V with the required properties. 


The construction made in the proof is known as Gram—Schmidt orthonor- 
malization. A basis (e1,...,eq) which satisfies condition (i) of the theorem is 
called an orthonormal bias More generally, if (e1,...,¢e,) is a sequence in an 
inner-product space V which satisfies condition (i) (with k replacing d), then 
(€1,.--,€k) is called an orthonormal sequence. Note that if (e1,...,e,) is an 
orthonormal sequence and ee, xje; = 0 then x; = os 1j¢3,€:) = 0 


for 1 <i <k; an orthonormal sequence of vectors is linearly independent. 
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If (e1,...,@q) is an orthonormal basis for V, and « = 5 


j=1 2jej € V then 
(o,@;) =a, for 1<i1<d, so that 


Thus if x,y € V then 


d d 
(t,y) = » (x, €;) (ej, 4) = Ds (x, €3) (ys €3)- 


In particular, 
d 
2 
lll? = 2 | (x, ey) P. 
j=l 


Corollary 11.4.2 JfW is a k-dimensional linear subspace of a Euclidean 
or unitary space V thenV =W@W+4, and there exists an orthonormal basis 
(€1,---,€q) of V such that (e1,...,e,%) is a basis for W and (ex41,...,€q) is 
a basis for Wt. 


Proof Let (a1,...,2%) be a basis for W. Extend it to a basis 
(a1,...,@q) for V, and apply Gram—Schmidt orthonormalization to obtain 
an orthonormal basis (e1,...,ea) for V. Then (e1,...,e%) is an orthonor- 
mal basis for W, and span (ep41,-..e€a) C Wt. On the other hand, if 
ore (z,e;)e; € W+ then (z,e;) = 0 for 1 < j < k, so that z = 
Bere (x, ej) ej € span (ex41,-..€q). Thus (e,41,-..,€q) is an orthonormal 
basis for W+. Since WM W+ = {0}, it follows that V =-WoWt. 


If « € V we can write x uniquely as y + z, with y € W and z € Wt. Let 
us set Py (x) = y. Py is a linear mapping of V onto W, and P?, = Py. Pw 
is called the orthogonal projection of V onto W. Note that Pyyi = I — Py. 

Although it is easy, the next result is important. It shows that an ortho- 
gonal projection is a ‘nearest point’ mapping; since it is linear, it relates the 
linear structure to metric properties. 


Proposition 11.4.3 If W is a linear subspace of a Euclidean or unitary 
space V and x € V then Py(zx) is the nearest point in W to x, and is the 
unique point in W with this property: ||x — Pw(x)|| < ||a — w|| for w € W, 


and if ||z — Pw(x)|| = || — w|| then w = Py (za). 
Proof Let (e1,...,¢€q¢) be an orthonormal basis for V which satisfies the 
conditions of Corollary 11.4.2. If a= ea jeg EV andw= y= we; CW 
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then 
k d 
2 2 2 
lz — wl? => olay — wl? + SO lal, 
j=l j=k+1 
and this is minimized if and only if w; = 2; for 1 < 7 < k, in which case 


a= Para). 


We shall extend Corollary 11.4.2 and Proposition 11.4.3 to certain inner 


product spaces in Section 14.3. 


Exercise 


11.4.1 Suppose that (21,...,2%,) is a basis for a Euclidean or unitary 
space V, and that (e),...,en) and (fi,..., fn) satisfy the conclu- 
sions of Theorem 11.4.1. Show that there are scalars \1,...,An of 
unit modulus such that f; = A;e; for 1 <j <n. 


11.5 Isometries 
A mapping f from a metric space (X,d) to a metric space (Y,p) is an 
isometry if it preserves distances: that is, if p( f(x), f(a’)) = d(«, 2’) for all 
x,v' EX. 

If T is a linear mapping from a normed space (£,]||.||,;) into a normed 
space (F, |].||,-) which is an isometry, then T is called a linear isometry. The 
mapping T is an isometry if and only if ||T’(x)||- = ||z||_¢ for all « € E. The 
condition is necessary, since 


IT@)lle = IT@) -TOlle = le -— Olle = Ilelle- 


It is sufficient, since 


T(z) -— Tolle = IP @ - lle = lle — glle- 


An isometry preserves the metric geometry of (X,d). Let us give some 
examples. 


Example 11.5.1 An isometry of N into R. 


Let N = N U {oo} be given the metric p defined in Example 11.1.9 in 
Section 11.1. Let f(n) =1/n and let f(oo) =0. Then f is an isometry of N 
into R, with its usual metric. 


Example 11.5.2 The mapping (x,y) > 2 + iy is a linear isometry of R? 
onto C, when C is considered as a real vector space. 
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Example 11.5.3 The conjugation mapping. 


The conjugation mapping z — Z is an isometry of C onto itself. It is a 
linear mapping when C is considered as a real vector space, but is not linear 
when C is considered as a complex vector space 


Example 11.5.4 Rotations of R?. 


Since 


(x cost — ysint)* + (zsint + ycost)* = 274+ y’, 
the linear mapping r; from R? > R? defined by 


ri(x,y) = (acost — ysint, xsint + ycost) 


is a linear isometry of R? onto R?. It is a rotation of R?. It is a bijection, 
with inverse r_;. 


Example 11.5.5 Translations of a normed space. 


Suppose that (F,||.||) is a normed space. If a € E, let Ty(x) = «+a; Ty 
is a translation. It is an isometry of (£,||.||) onto itself, since 


\|Za() — Ta(y)ll = I + @) — (y+ all = lle — gl 


Example 11.5.6 If (£,||.||) is a normed space, and \ is a scalar with 
|A| = 1 then the mapping x — Az is a linear isometry of F onto itself. 


Example 11.5.7 A linear isometry of 1?(R) onto /2,(R). 


If x,y € R then max(|x + y|,|z — y|) = |x| +|y|. Thus the linear mapping 
T : 2(R) > 12,(R) defined by T((z,y)) = (w@ + y,x — y) is an isometry of 
I2(R) onto /2,(R). 


Example 11.5.8 Reflections of a real inner-product space. 


Suppose that x is a non-zero vector in a real inner-product space V. If 
z € V, we can write z uniquely as z = Ax + y, where \ € Rand y € ct. Let 
px(Z) = —Ax + y, so that pz(x) = —a and p,(z) = z if and only if z € z+. 
Then pz is a linear mapping of V onto V, and is an involution: p2 is the 
identity mapping. It is an isometry, since 


2 2 2 2 
llec(z)||" =» lla? + [lvl = [lel 


It is called the simple reflection in the direction x, with mirror «+. 
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Suppose that x and y are distinct vectors in a real inner-product space 
V, with ||z|| = ||y||. Then 


(x +y,0—y) = (x, 2) — (x,y) + (yz) — (yy) = |lx\|? — llyll? = 9, 


so that (x + y)L(x — y). Thus 


[i] ot 
8 

+ 
og 
—— 
II 
NIK 
— 
< 

| 
ue 
+ 
NIK 
— 
8 

+ 
k= 
wa 
II 

= 


Pu—y(£) = Pu—y(5(@ — y)) + Pa—yl 
and pr—y(y) = x. 
Example 11.5.9 Linear isometries between inner-product spaces. 


Proposition 11.5.10 Suppose that S: V > W is a linear mapping from 
a real inner-product space V to a real inner-product space W. Then S is an 
isometry if and only if (S(x), S(y)) = (x,y) forz,yeV. 


Proof If S is an isometry, then 


(IS(@)I? + IS@)IP — Se) — S()IP) 
((|2 11? + llyll? — lle — yl?) = (ay). 


(S(x), S(y)) = 


i 
2 
1 
2 


The condition is sufficient, since ||S(x)|| = \/ (2, 2), 


Thus if (e1,..., ex) is an orthonormal sequence in V and S is an isometry 
then (S(e1),...,S(e,)) is an orthonormal sequence in W. 


Corollary 11.5.11 Jf (e1,...,eq) is an orthonormal basis for a Euclidean 
space V and T is a linear mapping from V into an inner-product space W 
then T is an isometry if and only if (T(e1),...,T(eq)) 1s an orthonormal 
sequence in W. 


Proof As we have just observed, the condition is necessary. If it is satisfied 


and x = i xje; € V then 
d . d 
2 2 
IZ(@)P = So aT) = 0 23 = lel. 
j=l j= 
If (e1,...,€,) is an orthonormal sequence in a real (or complex) inner- 


product space V, then the mapping T from R* (or C*) into V defined by 
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Deh = TF (hin x 5h) ) = ae xje; is an isometry, since 
k 
2 
IZ@)IPF = So lei? = Ile. 
j=l 


Any two Euclidean spaces of the same dimension are linearly isometric: if 
V and W are Euclidean spaces of dimension k, then there exists a linear map- 
ping of V onto W which is an isometry. Let (e),...,e,) be an orthonormal 
basis for V, and let (f1,..., f;) be an orthonormal basis for W. Let 
J(ayey +--+ + open) = ti fi +--+ + oe fe. 


Then J is a linear isometry of V onto W. 
Example 11.5.12 An isometry of a metric space (X,d) into I.(X). 


This example will be useful to us later. Let (X,d) be a metric space, with 
X non-empty, and let 1,.(X) = 1..(X,R) be the normed space of bounded 
real-valued functions on X introduced in Section 11.2. Let 29 be an element 
of X. If x € X, let 


fe(y) = d(@,y) — d(wo, y) for y € X. 

Since d(xo,y) < d(xo,x) + d(w,y) and d(x,y) < d(xo,7) + d(xo,y), by the 
triangle inequality, it follows that |f,(y)| < d(ao,2), so that fr € lno(X), 
and || fz||,, < d(vo,7). We claim that the mapping x > fz : X — I.(X) is 
an isometry. Since 
= d(x, y) - d(x’, y) = d(x, a’), 

= d(a’,y) _ d(x, y) = aaa) 
it follows that |fc(y) — fe(y)| < d(a,2’) for all y © X. Hence 
Il fc — fe’ \|o, < d(x, x"). On the other hand, 

fe(a') — fo (a’) = d(a, 2’) — d(a’,2') = d(z, 2’), 


and so ||fe— fell, = (0,2 ). 
Two metric spaces (X,d) and (Y,p) are said to be congruent if there is 


an isometry of X onto Y. (They are said to be similar if there exists a > 0 
and a mapping f of X onto Y such that p(f(zx), f(2’)) = ad(z, z’).) 

The composition of two isometries is an isometry, and the inverse of a 
bijective isometry is an isometry. Thus the set of bijective isometries of a 
metric space (X,d) onto itself forms a group under composition, the group 
of metric symmetries of (X,d). This group gives valuable information about 
the metric space. 
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Exercises 
Let I? denote R4 with norm ||z||, = Ee, EAE 


11.5.1 If A € Pj, let [4 be its indicator function: [4(j) = 1 if 7 € A and 
I4(j) = 0 otherwise. Show that the mapping A —> I, is an isometry 
of Py, with its Hamming metric, into ie. 

11.5.2 Let ej = (0,...0,1,0,...0), with 1 in the jth place. 

(a) Show that if 2 is a unit vector in I? and if max(|j¢+yl,, 
|x — y||,) > 1 for all y #0 then x = +e; for some 1 < j < d. 

(b) Let f be an isometry of 1? with f(0) = 0. Show that there exists 
a permutation o of {1,...,d} and a choice of signs (e1,...,€q) 
(that is, «; = +1 for 1 < j < d) such that f(ej) = cjeg(jy, 
f(-ej) = -€j@oG) for 1 <j <d. 

(c) Show that f is linear, so that f(x) = ae €j£;€q(;) for x € I. 

11.5.3 By considering vectors of the form (€1,...,€q), where ¢; = +1 for 


1 < j <d, show that if a mapping f of /¢, into itself is an isometry 
and if f(0) = 0 then there exists a permutation o of {1,...,d} and 


a choice of signs (€1,...,€q) (that is, €; = +1 for 1 < j < d) such 
that f(e;) = €j;€o73), f(—e;) = —€j€o(j) for 1 < j < d. Show that f is 
linear. 


11.6 *The Mazur—Ulam theorem* 


(This section can be omitted on a first reading.) 


Suppose that (£,||.||;) and (F,||.||) are real normed spaces and that 
J: E— F isan isometry. Let L = T_j(9)0 J, where T_ (9) is the translation 
of F mapping J(0) to 0. Thus L(x) = J(x)—J(0), so that L is an isometry of 
E into F,, with L(0) = 0. Our principal aim is to show that if L is surjective, 
then it must be linear. This extends the results of Exercises 11.5.2 and 11.5.3 
of the previous section. 


Theorem 11.6.1 (The Mazur—Ulam theorem) Jf L: E > F is an isome- 
try of a real normed space (E,||.||,,) onto a real normed space (F,||.||-) with 
L(0) = 0, then L is a linear mapping. 


In order to prove this, we introduce some ideas concerning the geometry 
of metric spaces, of interest in their own right. First, suppose that x, y, z are 
elements of a metric space. We say that y is between x and z if d(a,y) + 
d(y,z) = d(x, z), and we say that y is halfway between x and z if d(x, y) = 
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d(y,z) = s(x, z). We denote the set of points halfway between x and z by 
H(a,z). The set H(z, z) may be 


e empty, for example if (X,d) has the discrete metric, 

e a singleton set, for example if (X,d) is the real line R, with the usual 
metric, when H(z, z) = {5(x + z)}, 

e or may contain more than one point: let (X,d) = /2,(R), x = (—1,0), and 
2= (1,0); then A(g, 2) =4(0,9)2-1L< 9 < 1}. 


If (E,||.||;) is a normed space, then $(x + z) € H(x,z), but H(x,z) may 
contain other points, as the last example shows. The set H(z, z) is always 
bounded, since if y, y’ € H(x,z) then d(y,y’) < d(y,x) +d(z,y’) = d(z, z). 

Suppose that A is a bounded subset of a metric space (X,d). Can we 
find a special point in A which is the centre of A, in some metric sense? 
In general, the answer must be ‘no’, since, for example, in a metric space 
with the discrete metric, there is no obvious special point. In certain cases, 
however, the answer is ‘yes’. First, let 


K(A) = {x € A: d(x,y) < $diam (A) for all y € A}; 


k(A) is the central core of A. Again, k(A) may be empty, may consist of one 
point (which would then be the centre of A) or may consist of more than 
one point; for example, if 


A={(z,y) €R(R):-1<a<1,-5 <y< $5}, 


then K(A) = {(0,y) : -5 < y < $}. Note though that diam(«(A)) < 
diam A. This suggests that we iterate the procedure: we set «1(A) = K(A), 
and if k,(A) 4 0 we set kn41(A) = K(K,(A)). There are then three possible 
outcomes: 


e K,(A) = 0 for some n € N; 
e Kn(A) 4 for all n EN, but MP2, (Kn(A)) = 9; 
e Kkn(A) #O@ for all n EN, and N°, (Kn(A)) FO. 


If either of the first two cases occurs, then A does not have a centre. In the 
third case, diam MP2, (Kn(A)) < diamk,(A) < diam (A)/2”, for alln EN, 
so that diam M°°, (%n(A)) = 0, and NP°,kn(A) = {c(A)}, a singleton set. 
Then we call c(A) the centre of A. 

Let us give an example. A subset A of a real vector space F is 
symmetric if A = —A: that is, if x € A then —2 € A. 


Proposition 11.6.2 Jf A is a bounded symmetric subset of a normed space 
(E,||.||) and if 0 € A then 0 is the centre of A. 
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Proof Let us consider «(A). We show that «(A) is symmetric and that 
0 € K(A). First, if y € «(A) and x € A then 


d(—y, x) = ||-y — 2|| = ly + 2 = lly — (-2)|| = dy, -2) < Zdiam (A), 
so that —y € K(A) and «(A) is symmetric. Secondly, if x € A then 
d(x, 0) = ||x|| = 5 ||2z|| = 3 llz — (—x)|| = gd(a, —2) < 5diam (A), 


so that 0 € «(A). We can therefore iterate the procedure: K,(A) is 
symmetric, and 0 € &,(A), and so it follows that 0 is the centre of A. 


Corollary 11.6.3 If x,z € E then $(x + 2) is the centre of H(a, z). 


Proof First consider the case where z = —x. Then 0 € H(x,—x), and if 
y € H(z,—z) then 


Il(-9) — ell = lly - 2) = 5 Ilz - (-2)|| = lle). 


Similarly, ||(—y) — (—x)|| = 5 || — (—2)||. Thus H(z, —2) is symmetric, and 
0 is its centre. In the general case, let y = 3(a + z). Since translation is an 
isometry, 


A(x, 2) = Ty(H (a — y,2—y)) = Ty(A(« — y,-(@ — y))), 


so that y = T,(0) is the centre of H(z, z). 


The importance of this is that the centre is defined purely in terms of the 
metric, and not in terms of the vector space structure of EF. 


Proof of Theorem 11.6.1. Note that if A is a bounded subset of EF, with 
centre c(A), then, since L is a surjective isometry, L(A) is a bounded subset 
of F’, with centre ane) = L(c(A)). If z,z € EB, then L($(x@ + z)) = 
5(L(x)+L(z)), since $(x+z) is the centre of H(z, z) and $(L(x)+L(z)) is the 
centre of H(L(x), L(z )), In particular, considering 2x a 0 Lie) = 5L (22), 
and considering 2x and 2z, L(x + z) = 5(L(2x) + L(2z)) = L(x) + L(z). 
Thus L is additive. 

From this, we deduce the fact that L(Ar) = AL(x), for X € R and 
xz € E, in a few easy stages. First, an easy induction argument shows that 
L((n+1)z) = L(nz)+L(2) = nL(x)+L(x) = (n+1)L(z), for n € N; thus the 
result holds for A € Zt. Secondly, L(nx) + L(—nx) = L(0) = 0, so that the 
result holds for A € Z. Thirdly, if m € Z and n € N then L((m/2"~!)x) = 
2L((m/2")x), so that L((m/2")x) = 5L((m/2"~!)z), and another induction 
argument shows that L((m/2")x) = (m/2")L(a). Thus the result holds for 
all dyadic rationals (numbers of the form m/2”, with m € Z and n € N). 
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Finally, suppose that A € R and that « > 0. Then there exists a dyadic 
rational r = m/2” such that |A — r| < €/2(||z||_- +1). Thus ||Av — ra||p = 
|A—r| ||2|| ~ < €/2. Since L is an isometry, ||L(Ax) — L(rz)|| 7 < €/2, and also 


||P L(x) — AL(2)||p = |A—r||L(@) |p = |A- 7 [leIlp < €/2. 
Since L(rx) = rL(z), 
|LAx) — AL(2) |p 
< LAr) — L(raz)||p + ||L(rz) — rL(z)||p + ||\rL(@) — AL(2)||p <e. 


Since this holds for all € > 0, ||Z(Av) — AL(2)||~> = 0, and so L(Ar) = 
AL (a). 


The condition that D is surjective cannot be dropped. It follows from the 
mean-value theorem that if « < y then there exists x < z < y such that 
sin xz — sin y = (a —y) cos z, so that | sina —siny| < y—a. Thus the mapping 
L:R- I[2,(R) defined by L(t) = (t,sint) is an isometry of R into 12,(R) 
with Z(0) = (0,0) which is clearly not linear. 

There is however one important circumstance in which the surjective 
condition can be dropped. A normed space (£,||.||~) is strictly convex if 
whenever z,y € E, ||z||,- = ||z||~ =1 and z # y then F(a + y)]|| <1, 


Proposition 11.6.4 If (E,||.||,,) is strictly conver and x,z € E then the 
set H(x,z) of points halfway between x and z is the singleton set {5(x+z)}. 


Proof First consider the case where z = —2, and ||z|| = 1. Ifu € H(a, —z) 
then ||z + ul] = ||z — u|| = ||z|| = 1. Since 
v= 3((e+u)+(%—u)), 


it follows from strict convexity that u = 0, so that H(x,—x) = {0}. Then, 
by scaling, H(a,—x) = {0}, for all 2 € E. Finally, H(x, z) = {5(x+2)}, by 
translation. 


Corollary 11.6.5 Jf Ll: E + F is an isometry of a real normed space 
(E,||.||_~) into a strictly conver real normed space (F,||.||,-) with L(0) = 0, 
then L is a linear mapping, and (E,||.||~) is also strictly conver. 


Proof If x,z € E, then $(x + z) is the centre of H(zx,z), and so 


3(L(a) + L(z)) must be the centre of H(L(z),L(z)) in L(E). But the 


only possible centre of H(L(x),L(z)) in L(E) is $(L(x) + L(z)). Thus 
L(s(@ + 2)) = $(L(x) + L(z)) and $(L() + L(z)) € L(Z). An argu- 
ment exactly like the one given in Theorem 11.6.1 then shows that 


D(a +z) = L(x) + D(z), and that L(Az) = AL (az), for alla € E and AER. 
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Thus L is an isometric linear mapping of F onto a linear subspace of F’. 
Since a linear subspace of a strictly convex normed space is strictly convex, 
it follows that (£,||.||,~) is strictly convex. 


Many important normed spaces are strictly convex, and indeed have 
stronger metric convexity properties. 


Proposition 11.6.6 An inner-product space E is strictly convex. 


Proof This follows easily from the parallelogram law. If ||x|| = ||y|| = 1 
and « # y then it follows from the parallelogram law that 


2 2 2 2 
4 = 2(lall" + |lyll") = lla + yl + lle — gl, 


so that ||3(e + y)||° =1-Flle—yl? <1, and |$(e+y)|| <1 


Corollary 11.6.7 Jf L: E > F is an isometry of a real normed space 
(E,||.||_~) into a real inner-product space (F;,||.||-) with L(O) = 0, then L is 
a linear mapping, and (E, ||.||,~) is also an inner-product space. 


Exercise 


11.6.1 Let J : R ~ I2(R) be defined as J(t) = $(¢ — 1/(? + 1), 
t+ 1/(t? + 1)). Show that J is an isometry. Why does this not 


contradict the Mazur—Ulam theorem? 


11.7 The orthogonal group Og 


We now consider the group Oj of linear isometries of R@, with its Euclidean 
metric; this is the orthogonal group, and its elements are called orthogonal 
mappings. As an example, a simple reflection p, in the direction x is an 
orthogonal mapping. It follows from the polarization formula that a linear 
mapping T is orthogonal if and only if (T(x), T(y)) = (2, y), for all 2, y € R4. 
Let (e1,...,€q) be the standard basis of R?. It then follows that a linear 
mapping S from R% to itself is orthogonal if and only if (S(e1),...,S(ea)) 
is also an orthogonal basis for R?. 

This can be expressed in terms of the matrix representing S. If 
S is represented by the matrix (s;;) in the usual way (so that 
se) = ae sije; for 1 <j < d), then S is orthogonal if and only if 


d d 
S sf, =1forl<j<d, and Seen = Wier 1 <7 bea. 
i=l i=1 
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Such a matrix is called an orthogonal matrix. 


Theorem 11.7.1 Suppose that T € Og. Then T can be written as the 
product of at most d simple reflections. 


Proof Let S = T —TJ. We prove the result by induction on the rank 
r(S) of S. We show that if r(S) = r then T is the product of at most 
r simple reflections. If r(S) = 0 then T = I, which is the product of no 
simple reflections. Suppose that the result holds if r(S) <r, where r < d. 
Suppose that T € Og and that r(S) =r-+1. Let N be the null-space of S: 
N ={x2 € R?: T(a) = a}. By the rank-nullity formula, dim(N) = d—r—1. 
Let x be a unit vector in N+, so that S(a) 4 0. We consider the simple 
reflection psgz). 

If y € N then 


(y, S(x)) = (y, T(x)) — (y, 2) = (L(y), P(x) — (y, 2) = 0, 
so that ps(2)(y) = y- Also 


(S(x), T(a) + #) = (T(a) — #, T(x) + 2) 
= (T(x), T(a)) — (@,T(a)) + (T(@), «) — (#, @) = 0, 


so that T(x) + x € (S(x))+. Hence pgi)(T(x) + x) = T(x) + 2. But 
Ps(a) (I(x) — x) = —T(x) + 2, and so pgyq)(T(x)) = x. Let U = psig) oT. 
Then U € Og, U(x) = x and U(y) = y for y € N. Let M = span (N, 2), so 
that dim (M) = d-—r. Then (U —1)(z) =0 for z € M, and sor(U—T) <r. 
By the inductive hypothesis, U is the product of at most r simple reflections. 
Since T’ = pg) 0U, T is the product of at most r+ 1 simple reflections. 


Here are some more easy examples. Suppose that o is a permutation of 
{1,...,n}. If ¢ © R? let T(x) = (PeQjss~-4hoey)> Then Ty € Og; it 13 a 
permutation operator. Note that T>' = T,-1. Suppose that 0 < t < 27. If 
x € R4, let 


R(x) = (a1 cost — xg sint, x, sint + x2 cost, x3,...,Xa). 
Then R; € Og; it is an elementary rotation. Note that pi ae = Ron_t. 


Theorem 11.7.2. For d > 2, let Gq be the subgroup of Og generated by 
the permutation operators and the elementary rotations. Then Gq = Og. 


Proof We leave this as an exercise for the reader. 


11.7.1 


11.7.2 


11.7.3 


11.7.4 
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Exercises 


Let 7%),; € En be the permutation of {1,...,n} which transposes i and 
a: T(t) = 9, T1459) = 1 and G7(k) = k otherwise. Show that &,, is 
generated by the transpositions {71,; :2 <j < n}. 

Suppose that T € Og, and that r(T — I) = r. Show that T cannot be 
written as the product of fewer than r simple reflections. 

Interpret the equation (T(x)+2,T(x)—x) = 0 that occurs in 
Theorem 11.7.1 geometrically. 

Prove Theorem 11.7.2. 
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Convergence, continuity and topology 


12.1 Convergence of sequences in a metric space 


We now turn to analysis on metric spaces. The definitions and results that 
we shall consider are straightforward generalizations of the corresponding 
definitions and results for the real line. The same is true of the proofs; in 
most cases, they will be completely straightforward modifications of proofs 
of results in Volume I. We shall however present the material in a slightly 
different order. 

Suppose that (X,d) is a metric space, that (a,,)°°, is a sequence of ele- 
ments of X, and that 1 © X. We say that a, converges to 1, or tends to l, 
as n tends to infinity, and write a, > 1 as n — oo, if whenever € > 0 there 
exists Ng (which usually depends on €) such that d(a,,1) < € for n > no. In 
other words, the real-valued sequence (d(ay,/!))°°., tends to 0 as n > oo. 

Suppose that « € X and e > 0. The open e-neighbourhood N,(x) is defined 
to be the set of all elements of X distant less than € from a: 


N(x) = {y € X : d(y,z) < e}. 


We can express convergence in terms of open e-neighbourhoods: 
Gn — las n — oo if and only if for each « > O there exists ng such that 
an € N-(l) for n > no. 

We have the following consequence of Lemma 11.1.14. 


Proposition 12.1.1 Jf a,— 1 and b, > m as n > oo then 


d(an, On) + d(l,m) as n > oo. 
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Proof For |d(an, bn) — d(l,m)| < d(an,l) + d(bp,m), and 


d(an,l) + d(bn,m) 4 0 as n > ov. 


When they exist, limits are unique. 


Corollary 12.1.2 Ifa, ~ l asn—- © anda, > m as n > ~, then 
l=m. 


Proof Put b, = Gy. Then d(an,b,) = 0, so that d(l,m) = 0 and 
L=m. 


A subsequence of a convergent sequence converges to the same limit. 


Proposition 12.1.3 Ifa, > 1 asn— oo and if (an, )P.9 18 @ subsequence, 
then an, > 1 as k > oo. 


Proof Given « > 0 there exists N such that d(an,l) < € forn > N, 
and there exists kg such that n, > N for k > ko. Thus if k > ko then 
G(Ganst) 


Let us give two examples. First, suppose that 


(2g = ((2y",-- 2g )o 
is a sequence in R%, and that « = (x1,...,rq) € R%. We consider the 


Euclidean norm |j.||, and Euclidean metric on R4. If 1 < j < d then [a _ 


or Bes |x — a 9» 80 that if a) + x asn— co then 2” + Lj aS N—> Oo. 


j 
Conversely, suppose that a" > 2; a8n— oo forl<j<d. Givene > 0 
and 1 < j <d there exists nj € N such that [ed = iG < e/Vd for 7. > m;. 
Let N = max{n;:1<j<d}.Ifn>N then 


so that 2 + x as n > oo. Thus a sequence in R@ converges in 
the Euclidean metric if and only if each sequence of coordinates con- 
verges: convergence in the Euclidean metric is the same as coordinate-wise 
convergence. 

The second example is extremely important. Suppose that S is a set, that 
(X,d) is a metric space, that (fn)?29 is a sequence in the space Bx(S) 
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of bounded functions on S taking values in X, and that f € Bx(S). We 
consider the uniform metric dx on Bx(S). If s € S, then 


AU fn(s), F(8)) < sup{d(fn(t), F(t) + t © S} = doo(fns f); 


so that if f, — f in the uniform metric as n > co then f,(x) > f(x) as 
n— oo: fr, > f pointwise. But the convergence is stronger than that: given 
€ > O there exists no such that 


doo (fn, f) = sup{|fn(s) — f(s)|:s € S} <e for n > no. 


Thus there exists an ng independent of s such that |f,(s) — f(s)| < € for 
all n > no and all s € S. We say that f, > f uniformly on X as n > oo. 
The distinction between uniform convergence and pointwise convergence is 
most important. It is reassuring that the uniform convergence of bounded 
functions can be characterized in terms of a metric (and in terms of a norm, 
when X is a normed space). 

It is however useful to have a slightly more general definition. Suppose 
that (fn)°2, is a sequence of functions on S, taking values in (X,d), and 
that f is a function on S with values in X. Then we say that f, converges 
uniformly on S to f as n — oo if supjeg d(fn(t), f(t)) 4 0 as n — oo; in 
other words, we do not restrict attention to functions bounded on 3S. 

Let us give some easy but important results about uniform convergence. 
We shall generalize them later. 


Theorem 12.1.4 I[f (fn)°, is a sequence of continuous real-valued func- 
tions on a subset A of R, and if fy, converges uniformly on A to f asn— oo, 
then f is continuous. 


Proof Suppose that to € A and that « > 0. There exists N € N such that 
|fn(t) — f(t)| < €/3 for allt € A and n > N. Since fy is continuous at to, 
there exists 6 > 0 such that if t € A and |t —to| < 6 then |fy(t) — fn (to)| < 
e/3. If t € A and |t — to| < 6 then 


lf (t) — f(to)| < |f®) — fr(®| + |fn() — fx (to)| + | fn (to) — f(to)| <, 


so that f is continuous at to. 


Theorem 12.1.5 Suppose that (fn)°, is a sequence of Riemann inte- 
grable functions on a bounded interval [a,b] and that f, — f uniformly. 
Then f is Riemann integrable, and i fn(x) dx > - f(x) dz asn— co. 


Proof We use the following criterion, established in Corollary 8.3.5 of 
Volume I. 
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Lemma 12.1.6 <A bounded function f on an interval [a,b] is Rie- 
mann integrable if and only if given € > 0 there exists a dissection 
D={a=20 <--: < ry = 0} of [a,b] and a partition GUB of {1,...,k} 
such that 
Q(f,1;) <e forjeG and S15) <e 
jeEB 
where I,,...I, are the intervals of the dissection, and 


Q(f, 15) = sup |f(x) — fly)| 


xyel; 
is the oscillation of f on I;. 


First we show that f is bounded. There exists N € N such that 
doo(f, fn) < 1. If t € [a,b] then 


IFO) SIF) — fr) + [fv] < 1+ sup |frv(s)I, 


s€[a,b] 


so that f is bounded. 

Next we show that f is Riemann integrable. Suppose that « > 0. There 
exists M € N such that da(f, fn) < €/3 for n > M. By the lemma, there 
exists a dissection D = {a = to < --: < x, = 6} of [a,b] and a partition 
GUB of {1,..:,k} such that 


OQ(fu,1I;) <¢€/3 forj EG and SoU) <€ 
jEB 


where Jj,...J; are the intervals of the dissection. If 7 € G and s,t € J; then 
If(s) — F@®| < f(s) — far(s)| + |fre(s) — far (t)| + fur) — FO] <e, 


so that Q(f,1;) < €. Thus f satisfies the conditions of the lemma and so it 
is Riemann integrable. If n > M then 


[10 ape [ wt atl < [iso- (t)| dt < e(b— a), 


so that [” at )dt > f° f(t) t) dt as n > oo. 


Corollary 12.1.7 Suppose that (fn)°, is a sequence of continuously dif- 
ferentiable real-valued functions on an open interval (a,b) of R, and that the 
sequence (f!)°°, of derivatives converges uniformly on (a,b) tog asn + oo. 


Suppose also that there exists c € (a,b) such that fr(c) > | asn > co. Then 
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there exists a continuously differentiable function f on (a,b) such that fr 
converges uniformly on (a,b) to f as n— oo, and such that f' = g. 


Proof Ift € (a,b), let f(t) = 0+ le g(s) ds. Since g is continuous, by 
Theorem 12.1.4, we can apply the fundamental theorem of calculus: f is 
differentiable, with derivative g. Suppose that «€ > 0. There exists N € N 
such that 


lfn(e) — F(e)| < 5 and |fn(s) — 9(s)| < 
for n > N and s € (a,b). If t € (a,b) then 


f(t) — falt) = (FQ — fal) + | (f4(s) — g(s)) ds, 


so that ifn > N then 


so that f,, converges uniformly on (a,b) to f as n > ov. 


These proofs are rather easy. If we drop the continuity conditions, the 
proofs are considerably harder. 


Theorem 12.1.8 Suppose that (f,,)?2, is a sequence of differentiable real- 
valued functions on a bounded open interval (a,b). Suppose that 

(a) there exists c € (a,b) such that fn(c) converges, to | say, as 
n— oo, and 

(b) the sequence (f/,)°°, of derivatives converges uniformly on (a,b) to a 
function g. 

Then there exists a continuous function f on (a,b) such that 

(t) fn > f uniformly on (a,b), and 

(ii) f is differentiable on (a,b), and f'(x) = g(x) for all x € (a,b). 


Proof Suppose that € > 0. There exists N € N such that 


lfm (®) — fn()| 


€ 


< 26a) for x € (a,b) and m,n>N, 
and |fim(c) — fr(c)| < €/2 for m,n > N. By the mean-value theorem, 


I(fm(x) — fr(@)) — (fm(©) — fn())| S$ |e — e| sup |fin(y) — fr(w)l < 5 


yE[c,a] 
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for m,n > N, and so 
\fm(x) — fn(x)| < € for x € (a,b) and m,n > N. 


Thus, for each x € (a,b), (fm(x))°°_, is a Cauchy sequence, convergent to 
f(x), say, and 


f(x) — fn(x)| < € for x € (a,b) andn>N: 


fn 2 f uniformly on (a,b). Thus f is a continuous function on [a,b], by 
Theorem 12.1.4. 
Suppose that x € (a,b), that h £0 and that 2 +h € (a,b). Let 


dn(h) = fr(a +h) — fr(x) forn EN, 
dh) = f(a +h) — f(a); 
then d,(h) > d(h) as n > oo. If m > N then, by the mean-value theorem, 


ldm(h) —dy(h)| <|h| sup |fin(y) — fn(y)| < €lhl/4, 
yE|x,a+h] 


so that 


(dn (h) — hfm(2)) — (dn (h) — hfin(2))| 
< |din(h) — dn (h)| + [RU Fin (2) — Fv (2) 
< elh|/4 + e|h|/4 = €lh|/2. 
Letting m — ov, 
|(d(h) — hg(x)) — (dn (h) — hf (2))| < €|h{/2. 


But there exists 6 > 0 such that (a—6,2+06) C (a,b) and such that if |h| < 6 
then 


ldn(h) — hfn(x)| = |fn(a +h) — f(x) — hfn(2)| < €lh|/2, 
and so 


|d(h) — hg(@)| = |fla +h) — f(@) — hg(a)| < ¢|h| for |h] < 6. 


Thus f is differentiable at x, with derivative g(x). 


In the case where d is a metric on a vector space F (and in particular, 
when d is given by a norm), we can also consider the convergence of series. 
Suppose that (an,)P29 is a sequence in E. Let sp, = i=0 a;, for n € N, and 
suppose that s € E. Then the sum )°° 9 an converges to s if 5, + 8 as 
n> oc. 
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Exercises 


Suppose that a set S is given the discrete metric d. Show that a 
sequence (x,,)°°., converges to a point of S if and only if it is even- 
tually constant; there exists N € N such that x, = x,y for all 
n> WN. 

Suppose that (a,,)°2, is a sequence in a metric space which has the 
property that if (yx)? = (@n,)g21 is a subsequence of (#,,)°2., then 
there is a subsequence (2; )721 = (Ym, )F21 of (yx )Z21 which converges 
to 71. Show that 7, > 7, asn > ow. 

Suppose that (#,,)°°, and (y,)°2, are sequences in a normed space 
and that a and £ are scalars. Show that if )°°°., 2, converges to 
s and )>°_, yn converges to t then S°°°_, (ax, + Byn) converges to 
as + Ct. 

Suppose that }°°°., x, is a convergent series in a normed space. 
Show that ||z,,|| > 0 as n > oo. 

Give an example of a sequence (f,,)?°., of continuous real-valued 
functions on [0,1] which converges pointwise to a continuous 
function f on [0,1], but which does not converge uniformly to f. 
Give an example of a sequence (f,,)?2°., of continuous real-valued 
functions on [0,1] which decreases pointwise to a bounded function 
f on [0,1], but which does not converge uniformly to f. 

Give an example of a sequence (f,,)°°., of bounded continuous real- 
valued functions on [0,00) which decreases pointwise to a bounded 
continuous function f on [0,00), but which does not converge 
uniformly to f. 

Let h be the hat function: 


2t if0<t<4, 
A(t)=¢ 2-2 if$<t<1, 
0 otherwise. 
Let 
n Det : 
i a=] 
halt) = oz | doh (2 (:- i) 
k=1 j=l 


Sketch hy, hg and hg. Show that h, converges pointwise to 0, 
but that there exists no proper interval [c,d] in [0,1] on which it 
converges uniformly to 0. 

Formulate versions of Theorems 12.1.8 and 12.1.5 for infinite sums 
of functions. 
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12.1.10 Show that Theorem 12.1.5 does not hold for improper integrals over 
[0, co). 


12.2 Convergence and continuity of mappings 


Suppose that A is a subset of a metric space (X,d). An element b of X is 
called a limit point or accumulation point of A if whenever € > 0 there exists 
a € A (which may depend upon ¢) with 0 < d(a,b) < ¢. Thus 6 is a limit 
point of A if there are points of A, different from b, which are arbitrarily 
close to b. An element a of A is an isolated point of A if it is not a limit 
point of A; that is, there exists « > 0 such that N.(a) 1 A = {a}. 

If a € X and € > 0 then the punctured e-neighbourhood N*(a) of a is 
defined as 


Nila)={e-e X 20 < de) < e} = Na) \ fal. 


Thus 0 is a limit point of A if and only if N*(b) 1 A 40, for each € > 0. 

Suppose that (X,d) and (Y, ) are metric spaces and that f is a mapping 
from a subset A of X into Y. Suppose that b is a limit point of A (which may 
or may not be an element of A) and that | © Y. We say that f(x) converges 
to a limit l, or tends to 1, as x tends to b if whenever € > 0 there exists 6 > 0 
(which usually depends on €) such that if « € A and 0 < d(z,b) < 6, then 
p(f(x),l) < «. That is to say, as x gets close to b, f(a) gets close to 1. Note 
that in the case where b € A, we do not consider the value of f(b), but only 
the values of f at points nearby. We say that / is the limit of f as x tends 
to b, write ‘f(x) alas x — B’ and write / = lim,_,, f(z). 

We can express the convergence of f in terms of punctured e-neigh- 
bourhoods; f(x) > I as x — b if and only if for each € > 0 there exists 6 > 0 
such that if € AN. N¥(b) then f(x) € N.(1) — that is, f(NF(b)N A) C N-(1). 


Proposition 12.2.1 Suppose that f is a mapping from a subset A of a 
metric space (X,d) into a metric space (Y, p), and that b is a limit point of A. 
(i) If f(x) 3 l asa —b and f(r) 7m asx), thenl =m. 
(ii) f(x) > 1 as « > b if and only if whenever (an)? is a sequence in 
A \ {b} which tends to b as n — co then f(an) 41 asn— co. 


Proof (i) Suppose that « > 0. There exists 6 > O such that if 

xz € N3(b) NA then p(f(x),1) < € and p(f(x),m) < €. Since N#(b) N A is 

not empty, there exists x) € Ns(b) A. Then, using the triangle inequality, 
(l,m) < p(l, f(ao)) + plf(ao),m) <¢+e= 26 

Since this holds for all « > 0, p(l,m) =0 andl =m. 
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(ii) Suppose that f(x) > | as « — b and that (ap)?29 is a sequence in 
A \ {b} which tends to b as n — oo. Given € > 0, there exists 6 > 0 such 
that if « € Ns(b)N A then p(f(x),l) < ¢€. There then exists ng such that 
d(an,b) < 6 for n > no. Then p(f(an), 1) < € for n > no, so that f(an) > | 
as 1. —> 00. 

Suppose that f(a) does not converge to | as x — b. Then there exists 
€ > 0 for which we can find no suitable 6 > 0. Thus for each n € N there 
exists Zn € NF in(®) OA with p(f(xp), 1) > e. Then z, > b as n > co and 
f (an) does not converge to 1 as n — oo. 


Suppose now that f is a mapping from a metric space (X, d) into a metric 
space (Y,p), and that a € X. We say that f is continuous at a if whenever 
€ > 0 there exists 6 > 0 (which usually depends on ¢) such that if d(x,a) < 6 
then p(f(x), f(a)) < ¢. That is to say, as x gets close to a, f(x) gets close 
to f(a). If f is not continuous at a, we say that f has a discontinuity at a. 

We can express the continuity of f in terms of e-neighbourhoods; f is 
continuous at a if and only if for each « > 0 there exists 6 > 0 such that 
f(No(2)) © Ne(f(a)). 

Compare this definition with the definition of convergence. First, we only 
consider functions defined on X. This is not a real restriction; suppose that 
f is a mapping from a subset A of a metric space (X,d) into a metric space 
(Y,), and that a € A. We say that f is continuous on A at aif f: A> Y 
is continuous at a when A is given the subspace metric. Secondly, a need 
not be a limit point of X. If it is a limit point, then f is continuous at a if 
and only if f(z) > f(a) as x > a. Ifa is not a limit point, then there exists 
56 > 0 such that N5(a) = {a}, so that if ¢ € Ns(a) then f(x) = f(a), and f 
is continuous at a. 

We have the following immediate consequence of Proposition 12.2.1. 


Proposition 12.2.2 Suppose that f is a mapping from a metric space 
(X,d) into a metric space (Y,p), and that a € X. Then f is continuous at 
a if and only if whenever (an)?2.9 is a sequence in X which tends to a as 
n— oo then f(an) > f(a) asn > cw. 


Suppose that f is a real- or complex-valued function on a metric space 
(X, d). Unless it is explicitly stated otherwise, when we consider convergence 
and continuity properties of f, we give R or C its usual metric. 


Theorem 12.2.3. Suppose that f and g are functions on a metric space 
(X,d) taking values in a normed space (E,||.||;,), that A is a scalar-valued 
function on (X,d) and thatae€ X. 
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(i) If f is continuous at a then there exists 5 > 0 such that f is bounded 
on N;(a). 

(ii) If f(a) =0, f ts continuous at a, and X(x) is bounded on N;(a) for 
some 6 > 0, then Af is continuous at a. 

(iit) If f and g are continuous at a then f + g is continuous at a. 

(iv) If f and X are continuous at a then Af is continuous at a. 

(vu) If Xx) # 0 for x € X, and if X is continuous at a, then 1/A is 
continuous at a. 


Proof These results correspond closely to results for functions of a real 
variable (Volume I, Theorem 6.3.1). We prove (i), (iv) and (v), and leave 
the others as exercises for the reader. 

(i) There exists 6 > 0 such that || f(x) — f(a@)||_7 < 1 for « € Ns(a). Then 


IF@lle < If@) -—f@Olle + Ff Olle S1+1F@lle, for x € Ns(a). 


(iv) Suppose that € > 0. Let M = max(||f(a)||,,|A(@)|), and let 
n = min(e/(2M + 1),1). There exists 6 > 0 such that if x € N;(a) 
then ||(f(x) — f(@)|l_z < 7 and |A(x) — A(a)| < . If « € N5(a), then 


IF(@ le SNF@)- FOlle +I F(Olle <1 +M, so that 


A(z) F(e) — A(a) F(a) le = AC) — Ala) F(@) + A@(F(2) — F@)lle 
< |A(z) — A(@)]. TF @)le + AI F(@) — F@Ile 
< n(n +M)+Mn<e. 


(v) Suppose that « > 0. Let 7 = |A(a)|?e/2. There exists 6 > 0 such 
that |A(a) — A(a)| < max(|A(a)|/2,7) for « € Ng(a). If « € No(a), then 
|A(x)| > |A(a)|/2, and so 


Meine). 


_ pe — Xe) 


Proposition 12.2.4 (The sandwich principle) Suppose that f, g and h are 
real-valued functions on a metric space (X,d), and that there exists n > 0 
such that f(a) < g(x) < h(x) for all x € N,(a), and that f(a) = g(a) = 
h(a). If f and h are continuous at a, then so is g. 


340 Convergence, continuity and topology 


Proof This follows easily from the fact that 


\g(x) — g(a)| < max(| f(x) — f(a)], |h(@) — h(a). 


Continuity behaves well under composition. 


Theorem 12.2.5 Suppose that f is a mapping from a metric space (X, d) 
into a metric space (Y,p) and that g is a mapping from Y into a metric 
space (Z,a). If f is continuous ata € A and g is continuous at f(a), then 
gof is continuous at a. 


Proof Suppose that € > 0. Then there exists 7 > 0 such that 


9(Nn(f(@))) S Ne(9(F(a)))- 


Similarly there exists 6 > 0 such that f(Ns(a)) C N,(f(a)). Then 
g(F(No(a))) S 9(Nn(F(@))) S Ne(9(F(a)))- 


This proof is almost trivial: the result has great theoretical importance 


and practical usefulness. 

Continuity is a local phenomenon. Nevertheless, there are many important 
cases where f is continuous at every point of X. In this case we say that f 
is continuous on X, or, more simply, that f is continuous. Let us give some 
easy examples. 


1. An isometry from a metric space (X,d) into a metric space (Y,p) is 
continuous on X: given € > 0, take 6 =e. 

2. In particular, if A is a subset of a metric space (X,d) and A is given the 
subspace metric, then the inclusion mapping 7: A + X is continuous. If 
f : (X,d) > (Y,p) is continuous, then so is the restriction foi: A> Y 
of f to A. 

3. More generally, if f is a mapping from a metric space (X,d) to a metric 
space (Y,p) and if x € X then f is a Lipschitz mapping, with constant K, 
at x if p( f(x), f(a’)) < Kd(a, 2’) for all 2’ € X. If there exists K > 0 such 
that p(f(x), f(2’)) < Kd(a,2’), for all 2,2’ € X, then f is a Lipschitz 
mapping on X, with constant K. A Lipschitz mapping at x is continuous 
at x (given € > 0, take 6 = €/K). 

4. If f is a constant mapping from a metric space (X,d) into a metric space 
(Y,p) — that is, f(x) = f(y) for any x,y € X — then f is continuous: 
given « > 0, any 6 > 0 will do. 

5. If d is the discrete metric on a set X then every point of X is isolated, 
and any mapping f : (X,d) —> (Y,p) is continuous. 


6. 


10. 
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Suppose that A is a non-empty subset of a metric space (X,d). If x € X, 
let d(x, A) = inf{d(z,a) : a € A}. The mapping x — d(a, A) is a mapping 
from X to R. We show that it is a Lipschitz mapping with constant 1, 
and is therefore continuous. Suppose that x,y € X. If « > 0, there exists 
a € A with d(a,a) < d(x, A) +e. Then 


d(y, A) < d(y,a) < d(y, x) +d(z,a) < d(y,z) + d(x, A) +. 


Since € is arbitrary, d(y, A)— d(x, A) < d(y, x). In the same way, d(x, A)— 
d(y, A) < d(x,y) = d(y,x), and so |d(x, A) — d(y, A)| < d(z,y). 


. In particular, if (£, ||.||) is a normed space, then the mapping x — ||z|| is 


a Lipschitz mapping from EF to R with constant 1. For ||2|| = d(x, {0}). 


. The function tan is a continuous bijection from (—7/2, 7/2) onto R, when 


both are given the usual metric, and the inverse mapping tan~! is also 
continuous. A bijective continuous mapping f from a metric space (X, d) 
onto a metric space (Y,p) whose inverse is also continuous is called a 
homeomorphism. 


. We can give R@ the Euclidean metric dz. We can also consider R@ as 


B({1,2,...,d}), and give it the uniform metric d... Then 


1/2 
d / 


doo(x,y) = max |; — yjl S So lay — wl? 
Bae = 


= do(z,y) < d/7doo(a,y), 


so that the identity mapping i : (R%7,d2) > (R%,d..) is a homeomor- 
phism. If p; and p2 are two metrics on a set X for which the identity 
mapping 7 : (X, 1) > (X, 2) is a homeomorphism, then the metrics are 
said to be equivalent. If, as in the present case, i and i~! are Lipschitz 
mappings, then the metrics are said to be Lipschitz equivalent. 

Let C[0,1] be the vector space of (real or) complex continuous functions 
on [0,1]. We can give C[0, 1] the uniform metric: do(f,g) = sup{|f(x) — 
g(x)| : « € [0,1]}. We can also give it the metric defined by the inner 
product: do(f,g) = cn |f(x) — g(x)|?,dx)'/2. Since do(f,9) < doo(f,9), 
the identity mapping i : (C[0,1],dx.) — (C[0,1],d2) is a Lipschitz 
mapping, with constant 1. On the other hand, if we set f,(x) = 2”, 
then [frilly = de(fn,0) = 1/(2n + 1)¥?, so that do(fp,0) > 0 as 
nm —> oo, while ||frll,, = doo(fn,0) = 1, so that f, does not con- 
verge to 0 in the uniform metric as n — oo. Thus the inverse mapping 
i-! : (C[0, 1], dz) > (C0, 1], d..) is not continuous. 


We have the following generalization of Theorem 12.1.4. 
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Theorem 12.2.6 Jf (fn)°21 is a sequence of continuous functions from a 
metric space (X,d) into a metric space (Y,p) and if fr, converges uniformly 
on X to f asn— oo, then f is continuous. 


Proof Verify that a proof is given by making obvious notational changes 
to the proof of Theorem 12.1.4. 


Exercises 


12.2.1 Show that any two metrics on a finite set are Lipschitz equivalent. 

12.2.2 Let (N,p) be the metric space defined in Example 11.1.9. Suppose 
that (%,)°2, is a sequence in a metric space (X,d), and that x € X. 
Set f(n) = Zn, f(+00) = x. Show that z, — z as n > oo if and only 
if f : (N, p) > (X,d) is continuous. 

12.2.3 Suppose that (X,d), (Y,e) and (Z,c) are metric spaces, that f is 
a continuous surjective mapping of (X,d) onto (Y,p) and that g : 
(Y, p) > (Z,c) is continuous. Show that if go f is a homeomorphism 
of (X,d) onto (Z,o) then f is a homeomorphism of (X, d) onto (Y, p) 
and g is a homeomorphism of (Y,p) onto (Z,o). 

12.2.4 Show that the punctured unit sphere {x¢R%: ||z|| =1} \ 
{(1,0,...,0)} of R%, with its usual metric, is homeomorphic to R&!. 

12.2.5 Give an example of three metric subspaces A, B and C’' of R such 
that A C B CC, A and C are homeomorphic, and B and C are not 
homeomorphic. 


12.3 The topology of a metric space 


This section contains many definitions: we start with a few. 

Suppose that A is a subset of a metric space (X,d). Recall that a point 
b€ X isa limit point, or accumulation point, of A if and only if Ni(b)N.A # 
0, for each € > 0. We now make another definition, similar enough to be 
confusing. An element b of X is called a closure point of A if N-(b)N AF 9, 
for each « > 0. That is to say, whenever € > 0 there exists a € A (which 
may depend upon e€) with d(b,a) < «. Thus 0 is a closure point of A if there 
are points of A arbitrarily close to b. If b € A, then b is a closure point of A, 
since d(b,b) = 0 < « for alle > 0. 

The set of limit points of A is called the derived set of A, and is denoted 
by A’, and the set of closure points of A is called the closure of A, and is 
denoted by A. A set A is perfect if A = A’ and is closed if A= A. 

An element a of A is an isolated point of A if there exists « > 0 such that 
N.(a) 1A = {a}. 
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Proposition 12.3.1 Suppose that A is a subset of a metric space (X,d). 
Let i(A) be the set of isolated points of A. Then A’ and i(A) are disjoint, 
and A= A’ Ui(A). 


Proof ‘This follows immediately from the definitions. 


The set A is a subset of A, since each point of A is a closure point of A, 
but any isolated point of A is not in A’. A set is perfect if and only if it is 
closed, and has no isolated points. 

We can characterize limit points and closure points of A in terms of 
convergent sequences. 


Proposition 12.3.2 Suppose that A is a subset of a metric space (X,d) 
and that be X. 

(i) b is a limit point of A if and only if there exists a sequence (a;)j-, in 
A\ {b} such that a; — b as j + oo. 

(ii) b is a closure point of A if and only if there exists a sequence (a;) 
in A such that aj — b as j + oo. 


oo 
j=1 


Proof (i) Suppose that there exists a sequence (a;)729 in A\ {b} such that 
a; —> b as j + oo. Suppose that € > 0. There exists jg such that d(b, a;) < € 
for j > jo. Then a;, € N*(b). Conversely, if b is a limit point of A then for 
each j € N there exists a; € A \ {b} with 0 < d(b,a;) < 1/j. Then a; > b 
as J > Co. 

(ii) The proof is exactly similar. 


Proposition 12.3.2 (ii) says that A is closed if and only if A is closed under 
taking limits. 
The closure of a bounded set is bounded. 


Proposition 12.3.3 Jf A is a non-empty bounded subset of a metric space 
(X,d), then diam A = diam A. 


Proof Certainly diam A > diam A. Suppose that € > 0. If x,y € A there 
exist a,b € A with d(x,a) < €/2 and d(y,b) < €/2. Then, by the triangle 
inequality, 


d(x,y) < d(x,a) + d(a,b) + d(b,y) < d(a,b) +e < diamA-+e, 


so that diam A < diam A + e. Since € is arbitrary, the result follows. 


A subset A of a metric space (X,d) is dense in X if A = X. For example, 
the rationals are dense in R. 
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Proposition 12.3.4 Suppose that A and B are subsets of X. 

(i) If AC B then ACB. 

(ii) A is closed. 

(iii) A is the smallest closed set containing A: if C is closed and A C C 
then ACC. 


Proof (i) follows trivially from the definition of closure. 
(ii) Suppose that b is a closure point of A and suppose that ¢ > 0. Then 

there exists c € A such that d(b,c) < €/2, and there exists a € A with 

d(c,a) < €/2. Thus d(b, a) < e, by the triangle inequality, and so b € A. 
(iii) By (Gi), AC C=C. 


Suppose that Y is a metric subspace of a metric space (X,d). How are 
the closed subsets of Y related to the closed subsets of X? 


Theorem 12.3.5 Suppose that Y is a metric subspace of a metric space 
(X,d) and that A CY. Let A’ denote the closure of A in Y, and AX the 
closure in X. 

(i) A =A* ny. 

(it) A is closed in Y if and only if there exists a closed set B in X such 
that A= BNY. 


Proof (i) Certainly a C a, so that A” Cc A* AY. On the other hand, 
ify € A* Y there exists a sequence (d,,)72., in A such that an > y as 
n — oo. Thus y € qa. 

(ii) If A is closed in Y, then A = A ee ph Y, so that we can take 
Bear. Conversely if B is closed in X and A= BNY, then A C B, so 
that _ HF ay eBay =A and A is closed in Y. 


Here are some fundamental properties of the collection of closed subsets 
of a metric space (X, d). 


Proposition 12.3.6 (i) The empty set 0 and X are closed. 

(ii) If A is a set of closed subsets of X then N4c AA is closed. 

(itt) If {Ay,..., An} is a finite set of closed subsets of X then UF Ay is 
closed. 


Proof (i) The empty set is closed, since there is nothing to go wrong, and 
X is trivially closed. 

(ii) Suppose that 6 is a closure point of N4c4A, and that A € A. Ife > 0 
then there exists a € N4e4A with d(b,a) < «. But then a € A. Since this 
holds for all « > 0, b € A = A. Since this holds for all A € A, b € N4caA, 
and so N4¢.4A is closed. 
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(iii) Suppose that 6 ¢ Uf_,Aj. Then for each j, b ¢ Aj = 
so there exists €; > 0 such that if d(b,c) < ¢; then c ¢ Aj. Let 
e = min{e;: 1 <j <n}. Then ce > 0 and if d(b,c) < € then c ¢ Uf_, Aj. 
Thus 0 is not a closure point of Us Aj, and so Up A; is closed. 


Here is an important example. 


Theorem 12.3.7 Suppose that (X,d) and (Y,p) are metric spaces. Let 
Cy(Y,X) denote the set of all bounded continuous mappings of Y into X. 
Then Cy(Y, X) is a closed subset of the space Bx (Y) of all bounded mappings 
of Y into X, when Bx(Y) is given the uniform metric doo. 


Proof If f is in the closure of C,(Y, X), then, by Proposition 12.3.2 there 
exists a sequence (f,)°72, in C,(Y,X) which converges uniformly to f. It 


then follows from Theorem 12.2.6 that f is continuous. 


We now introduce some more definitions. Suppose that A is a subset of a 
metric space (X,d). 


e An element a of A is an interior point of A if there exists €« > 0 such that 
N.(a) C A. In other words, all the points sufficiently close to a are in A; 
we can move a little way from a without leaving A. 

e The interior A®° of A is the set of interior points of A. 

e A subset U of X is open if U = U°. In other words U is open if and only 
if whenever u € U there exists € > 0 such that if d(u,v) < e€ then v € U. 
The collection of open subsets of (X,d) is called the topology of (X,d). 


Proposition 12.3.8 If (X,d) is a metric space, if x € X and ife > 0 
then the open €-neighbourhood N,(a) is open. 


Proof Suppose that y € N.(x), so that d(y,x) < «. Let 6 =e —d(y,z). If 
z € Ns(y) then, by the triangle inequality, 


d(z,x) < d(z,y)+d(y,x) <6+d(y,z) =e, 


so that Ns(y) C N(x). 


If (£,||.|]) is a normed space then the e-neighbourhood 
N,(0) = {x € E: ||z|| < e} 


is called the open e-ball; in particular, Ni(0) = {x € E: ||x|| < 1} is called 
the open unit ball. The scaling property of the norm implies that N.(0) = 
€N,(0). Although a normed space has plenty of closed linear subspaces, it 
has only one open linear subspace. 
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Proposition 12.3.9 Jf F is an open linear subspace of a normed space 
(E, ||.||) then F = E. 


Proof Since F is a linear subspace, 0 € F’. Since F is open, there exists 
€ > 0 such that N.(0) C F. Suppose that x € FE \ {0}; let y = ex/(||z|| + 1). 
Then 0 < |ly|| < €, so that y € N.(0), and so y € F. Since F is a linear 
subspace of E, x = (||x|| + 1)y/e € F. This is true for all x € E, so that 
F=E. 


‘Interior’ and ‘closure’, ‘open’ and ‘closed’, are closely related, as the next 
proposition shows. 


Proposition 12.3.10 Suppose that A and B are subsets of a metric space 
(X,d), and that C(A) = X \ A is the complement of A in X. 

(i) If AC B then A° C BY. 

(ii) C(A°) = C(A). 

(iti) A is open if and only if C(A) is closed. 

(iv) A° is open. 

(v) A®° is the largest open set contained in A: if U is open andU C A 
then U C A®. 


Proof — (i) follows directly from the definition. 
(ii) If b ¢ A° then N.(b) MN C(A) # @ for all « > 0, and so b € C(A). 
Conversely, if b € C(A) then N.(b) N C(A) 4 O for all € > 0, and so b ¢ A°. 
(iii) If A is open then C(A) = C(A°) = C(A), by (ii), and so C(A) is 
closed. If C(A) is closed then C(A°) = C(A) = C(A), so that A° = A. 
(iv) C(A°) = C(A) is closed, so that A®° is open, by (iii). 
(v) By G), UG =U? € A®, 


Corollary 12.3.11 Suppose that Y is a metric subspace of a metric space 
(X,d) and that AC Y. Then A is open in Y if and only if there exists an 
open set B in X such that A= BOY. 


Proof Take complements. 


Corollary 12.3.12 (i) The empty set 0 and X are open. 

(ii) If A is a set of open subsets of X then Usc aA is open. 

(itt) If {Ay,..., An} is a finite set of open subsets of X then MaiAy ts 
open. 


Proof Take complements. 


Two final definitions: if A is a subset of a metric space (X,d) then the 


frontier or boundary OA of A is the set A \ A°. Since 0A = ANC(A), OA is 
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closed. x € OA if and only if every open e-neighbourhood of 2 contains an 
element of A and an element of C(A). 

A metric space is separable if it has a countable dense subset. Thus R, 
with its usual metric, is a separable metric space. 

There are interesting metric spaces which are not separable: 


Proposition 12.3.13 Jf (X,d) is a metric space with at least two points 
and if S is an infinite set, then the space Bx(S) of bounded mappings from 
S—+> X, with the uniform metric, is not separable. 


Proof We use the fact that P(S) is uncountable; this was proved in Vol- 
ume I, Corollary 2.3.10. Suppose that x9 and x are distinct points of X, and 
let d = d(xo, 21). For each subset A of X, define the mapping f4:S > X 
by setting 

fa(s) =21 ifsc A and fa(s) =o if a ¢ A. 


Then f,4 is bounded. If A and B are distinct subsets of $, then there exists 
s € S such that s is in exactly one of A and B, and so dx(fa, fg) = d. 
Thus Naj2(fa) A Najo(fa) = 9. Suppose that G is a dense subset of Bx (S). 
Let H = {g € G: 9 € Nayo(fa) for some A € P(X)}. If g € H, then there 
exists a unique A € P(S) for which g € Ngyo(fa): let this be c(g). Then ¢ 
is a mapping of H into P(S). It is surjective, since if A € P(S) there exists 
g € G with dy(g, fa) < d/2, by the density of G, so that c(g) = A. Since 
P(S) is uncountable, so is H, and since H C G, G is uncountable. Thus 
Bx(S) is not separable. 


Exercises 


12.3.1 Show that a finite subset of a metric space (X,d) is closed. 

12.3.2 Suppose that (aj) Foy is a sequence in a metric space (X,d) which 
converges to a. Show that the set S = {a; : 7 € N} U {a} is closed. 

12.3.3 Give an example of an open e-neighbourhood N,(z) in a metric space 
(X,d) whose closure is not equal to M.(x) = {y € X : d(y, x) < ¢}. 

12.3.4 Let (£,_||.||) be a normed space. Let U = {x € E: ||x|| < 1}. Show 
that U = {x € E: ||z|| < 1}. 

12.3.5 Let D = {(i,j):1<i< Jj <d}. A real quadratic form on R4 is a 
function of the form qq(z) = 0 ,;,ep %jTiXj, Where a € RY. It is 
positive definite if qa(x) > 0 for all « 4 0. Show that 


{a € R® : qq is positive definite} 


is open in RP. 
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12.3.14 


12.3.15 


12.3.16 


12.3.17 


12.3.18 


12.3.19 


Convergence, continuity and topology 


Give an example of two subsets A and B of R, for which there exist 
continuous bijections f : A > B and g: B > A, but which are not 
homeomorphic. 
Show that the interior of the boundary of a subset of a metric space 
is empty. 
Suppose that (X,d) is a metric space and that a € X. Show that 
the following are equivalent 

(i) a is an isolated point of (X,d). 

(ii) {a} is open. 
(iii) Any real-valued function on X is continuous at a. 
(iv) If tz, — a as n — oo then there exists N € N such that z, =a 

forn > N. 

Suppose that A is a subset of a normed space (£,||.||), that « © E 
and that \ is a scalar. Show that z+ A= 2+A, that X\A = XA and 
that (1+ A)° = «+ A°. Under what circumstances is (AA)° = AA°? 
Show that A and A° are convex if A is convex, and that —A = —A 
and (—A)° = —A®. 
A collection B of open subsets of a metric space (X,d) is a base or 
basis for the topology if every open subset of X is a union of sets 
in B. Show that the collection of open intervals of R with rational 
endpoints is a basis for the usual topology of R. 
Suppose that (X,d) is a perfect metric space, and that S is a dense 
subset of X. Show that if F’ is a finite subset of S then S\ F' is dense 
in X. Show (using the axiom of dependent choice) that there exists 
an infinite subset J of S such that S \ J is dense in X. 
Show that a separable metric space has a countable basis for the 
topology. 
Show that a metric space with a countable basis for the topology is 
separable. 
Show that a metric subspace of a separable metric space is 
separable. 
Show that a set S, with the discrete metric, has a countable basis 
for the topology if and only if it is countable. 
Use the three preceding exercises to give another proof that if X is 
infinite then B(X), with the uniform metric, is not separable. 
Let co = {@ € ly : Zn 4 0 as n > oo}. Show that co is a separable 
closed linear subspace of I9. 
Suppose that UY is a set of open subsets of a separable metric space, 
any two of which are disjoint. Show that U is countable. 
Suppose that f is a real-valued function on a metric space (X,d). 
f has a strict local maximum at «x if there exists « > 0 such that if 
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0 < d(x,y) < € then f(y) < f(a). Show that if (X,d) is separa- 
ble then the set of strict local maxima is countable. (Consider a 
countable basis 6 for the topology, and consider the sets of 6 on 
which f is bounded above, and attains its supremum at a unique 
point). 

12.3.20 Suppose that f is a mapping from a metric space (X, d) into a metric 
space (Y,p). f has a removable discontinuity at a if a is a limit point 
of X, if 1 = lim,_,, f(x) exists, and 1 4 f(a). Show that if (X,d) is 
separable then f has only countably many removable discontinuities. 

12.3.21 A metric d, which like the p-adic metric, satisfies 


d(a,z) < max(d(x,y),d(y,z)) for x,y,z € X, 


is called an ultrametric. Verify that the p-adic metric on Q is an 
ultrametric. 

12.3.22 Show that if one considers the three distances between three points 
of an ultrametric space then either they are all equal or two are 
equal, and greater than the third. 

12.3.23 Show that an open e-neighbourhood in an ultrametric space is 
closed. 

12.3.24 Suppose that N,-(a) is an open e-neighbourhood in an ultrametric 
space and that y € N.(x). Show that N.(x) = N-(y). 


12.4 Topological properties of metric spaces 


Recall that the topology of a metric space is the collection of open subsets. 
Many, but by no means all, of the properties of a metric space (X,d) and of 
mappings from (X,d) into a metric space (Y,), can be defined in terms of 
the topologies of (X,d) and (Y,p). These are called topological properties. 
Thus 


e convergent sequences; 

e closure point, closure, closed set, dense set, separability; 
e interior, frontier or boundary; 

e limit point, isolated point, derived set, perfect set 


are all topological notions. On the other hand, the notion of an 
e-neighbourhood is not a topological one. But if we define a neighbourhood 
of a point x of a metric space (X,d) to be a set which contains N,(x) for 
some € > 0, then the notion of neighbourhood is a topological one, since N 
is a neighbourhood of x if and only if x is in the interior of N. A collection 
N of subsets of X is called a base of neighbourhoods of x if each N € N is 
a neighbourhoood of x, and if each neighbourhood of x contains an element 
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of NV. Thus the e-neighbourhoods of x form a base of neighbourhoods of 2, 
and the set {Ni/,(x) : n € N} is a countable base of neighbourhoods of 
x. Let M(x) = {y € X : d(y,x) < €}. Then M,(x) is a neighbourhood of 
x, since N(x) C M,(x). The set M(x) is closed, and is called the closed 
e-neighbourhood of x. If 0 < 7 < € then M,(x) C N.(x), so that the set 
{M.(x) : € > 0} of closed neighbourhoods is also a base of neighbourhoods 
of x. 
Continuity is also a topological property. Let us make this explicit. 


Theorem 12.4.1 Suppose that f is a mapping from a metric space (X, d) 
into a metric space (Y,p) and thatae X. 

(a) f is continuous at a if and only if whenever N is a neighbourhood of 
f(a) in Y then f~1(N) is a neighbourhood of a. 

(b) The following are equivalent. 

(i) f is continuous on X. 

(ii) If U is an open subset of Y then f—1(U) is open in X. 

(iii) If F is a closed subset of Y then f—'(F) is closed in X. 

(iv) If A is a subset of X then f(A) C f(A). 


Proof (a) Suppose that f is continuous at a and that N is a neighbourhood 
of f(a). Then there exists « > 0 such that N.(f(a)) C N. Since f is contin- 
uous at a there exists 6 > 0 such that if d(z,a) < 6 then p(f(x), f(a)) <. 
This says that Ns(a) C f—1(N-(f(a))), so that Ns(a) C f-(N), and f-1(N) 
is a neighbourhood of a. 

Conversely, suppose the condition is satisfied. If e > 0 then N.(f(a)) isa 
neighbourhood of f(a), and so f~!(N.-(f(a))) is a neighbourhood of a. Thus 
there exists 6 > 0 such that N5(a) C f~!(N.(f(a))). Being interpreted, this 
says that if d(x,a) < 6 then p(f(zx), f(a) <e. 

(b) Suppose that f is continuous on X, that U is open in Y and that 
x € f-'(U). Then f(x) € U. Since U is open, there exists € > 0 such that 
N.(f(x)) C U. Since f is continuous at x, there exists 6 > 0 such that if 
d(x',x) < 6 then p(f(z’), f(x)) < e. Thus Ng(x) C f-1(U), and so z is an 
interior point of f~'(U). Since this holds for all 2 €¢ f~!(U), f~!(U) is open: 
(i) implies (ii). 

Conversely, suppose that (ii) holds. Suppose that a € X and that N is 
a neighbourhood of f(a). Then there exists « > 0 such that N.(f(a)) C 
N. Then a € f1(N-(f(a))) C f-t(N), and f7!(N.(f(a))) is open, by 
hypothesis, so that f~'(V) is a neighbourhood of a. Thus f is continuous 
at a. Since this is true for all a € X, (ii) implies (i). 

Since a set is open if and only if its complement is closed, and since 
f-1(C(B)) = C(f7!(B)), (ii) and (iii) are equivalent. 
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Suppose that (iii) holds, and that A C X. Then f(A) is closed in Y, and 
so f-'(f(A)) is closed in X. But A C f~!(f(A)), and A is the smallest 
closed set containing A, and so A C f~1(f(A)); that is, f(A) C f(A). Thus 
(iii) implies (iv). 

Suppose that (iv) holds, and that B is closed in Y. By hypothesis, 
(FB) FMB). But {(f-(B)) © B, so that FFB) CB = B. 
Thus f(f-!(B)) C B, and so f-!(B) C f-!(B). Consequently f-1(B) = 
f-1(B): f-1(B) is closed. Thus (iv) implies (iii). 


Corollary 12.4.2 Suppose that f is continuous on X and that A is a dense 
subset of X. Then f(A) is dense in the metric subspace f(X) of (Y,p). In 
particular, if X is separable, then so is f(X). 


Proof For f(X) = f(A) C f(A). 
Corollary 12.4.3 Suppose that f is a bijective mapping f from a metric 


space (X,d) onto a metric space (Y,p). The following are equivalent: 


1. f is a homeomorphism. 

2. U is open in (X,d) if and only if f(U) is open in (Y,p). 
3. B is closed in (X,d) if and only if f(B) is closed in (Y,p). 
4. f(A) = f(A) for every subset A of X. 


Corollary 12.4.4 Suppose that f is a bijective mapping f from a metric 
space (X,d) onto a metric space (Y,p). Then f is a homeomorphism if and 
only if whenever (2n,)°2, is a sequence in X then xr, > x in (X,d) asn > co 


if and only if f(an) > f(x) as n> oo. 


There are two points to notice about this theorem and its proof. The 
first is that one needs facility at handling images and inverse images of sets. 
The second and more important point is that the conditions, in terms of 
open sets and closed sets, that we have given for a function to be continuous 
involve the inverse images of sets in Y, and not the images of sets in X. 


Exercises 


12.4.1 We have defined topological notions or properties to be those that 
can be defined in terms of the open sets of a metric space. Show that 
a notion or property is topological if it can be defined in terms of 
each of the following. 
(a) The neighbourhoods of each point. 
(b) The closed sets. 
(c) The mapping which sends each subset of X to its closure. 
(d) The mapping which sends each subset of X to its frontier. 
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Suppose that f and g are continuous mappings from a metric space 
(X,d) into a metric space (Y,). Show that the set 


{ce X: f(x) = g(@)} 


is closed in X. 
Suppose that A is a non-empty subset of a metric space (X,d). If 
x € X let d(x, A) = inf{d(z,a) : a € A}. Show that 


A={xeEX: d(z, A) =0}. 


Suppose that A and B are disjoint closed subsets of a metric space. 
If x € X let g(x) = d(x, A) + d(x, B). Show that g(x) is a continuous 
strictly positive function on X. Let h(a) = d(x, A)/g(ax). Show that 
g is a continuous function on X which satisfies 
e0<AhA(z) <1, forve xX; 

e h(x) =0, for x € A; 

e A(x) =1, for x € B. 

This is easy. Its extension to certain topological spaces is Urysohn’s 
lemma, which we shall prove later (Theorem 13.4.6); the proof is 
much harder. 


13 


Topological spaces 


13.1 Topological spaces 


The results of the previous section show that many important results con- 
cerning metric spaces depend only on the topology. We now generalize this, 
by introducing the notion of a topological space. This is traditionally defined 
in terms of open sets. A topological space is a set X, together with a 
collection 7 of subsets of X which satisfy: 


e the empty set and X are in 7; 
e if O Cr then UgeoO ET; 
e if Oy and Op» are in T then O, M Oo € T. 


Then 7 is the topology on X, and the sets in 7 are called open sets. The 
conditions say that the empty set and X are open, that the union of an 
arbitrary collection of open sets is open, and that the intersection of finitely 
many open sets is open. 

The first example of a topological space is given by taking the open sets 
of a metric space (X,d) for the topology on X; this is the metric space 
topology on X. A topological space (X,7) is said to be metrizable if there 
is a metric d on X such that 7 is the set of open sets of the metric space 
(X, d). 

Why do we make this definition? First, there are many important exam- 
ples of topological spaces in various areas of mathematics, including not only 
analysis but also logic, algebra and algebraic geometry, which are not given 
by a metric. In fact, we shall not need any of these, but it is as well to know 
that they exist. Secondly, metric spaces have a rich structure, and it is appro- 
priate to develop topological properties of metric spaces in a purely topologi- 
cal way — this helps us to appreciate the nature of these properties. Thirdly, 
there are many examples of topological spaces with weird and wonderful 


353 


354 Topological spaces 


properties, and it is entertaining to investigate them; we shall do this in 
Section 13.6. 


Starting from a topology 7 on a set X, we can immediately set up the 


machinery that has been defined for metric spaces. Suppose that « € X and 
that A is a subset of X. Here are the definitions. 


A subset N of X is a neighbourhood, or T-neighbourhood, of x if there is 
an open set O such that « € O C N. The set of neighbourhoods of 2 is 
denoted by NV. A subset B of Nz is called a base of neighbourhoods of x 
if whenever N € NV, there exists B € B with BC N. 

A subset M*(ax) of X is a punctured neighbourhood of x if there is a 
neighbourhood M of x such that M*(x) = M \ {a}. 

x is a limit point, or accumulation point, of A if M*(x) 1 A #0 for every 
punctured neighbourhood M*(z) of x. The set of limit points of A is called 
the derived set of A, and is denoted by A’. A is said to be perfect if A = A’. 
x isa closure point of Aif NAA #9 for every N € N,. The set of closure 
points of A is called the closure of A, and is denoted by A. A is said to be 
closed if A = A. A is said to be dense in X if A = X. (X,7) is separable 
if there is a countable subset C' of X which is dense in X. 

x is an isolated point of A if there exists N € N; such that NN A = {2}. If 
i(A) is the set of isolated points of A then A’/Ni(A) = @ and A = A’Ui(A). 
x is an interior point of A if A € N,. The set of interior points of A is the 
interior of A; it is denoted by A®. 

The frontier, or boundary, 0A is the set A \ A®. 

A subset 6 of a topology 7 on a set X is a base for the topology if every 
open set is the union of subsets in B. 

Suppose that (#,)°°, is a sequence in X. Then x, > x as n > oo if for 
each N € N;, there exists no € N such that x, € N for all n > no. 
Suppose that f is a mapping from A into a topological space (Y,c), that 
b is a limit point of A, and that 1 ¢ Y. Then f(x) +l asx — bin A (in 
words, f(x) tends to, or converges to, | as x tends to b in A) if whenever NV 
is a neighbourhood of | then there is a punctured neighbourhood M*() 
of b such that f(M*(x) MA) CN. 

Suppose that f is a mapping from X into a topological space (Y,o). Then 
f(x) is continuous at x if, whenever N is a neighbourhood of f(x) then 
f-1(N) is a neighbourhood of x. f is continuous on X (or, simply, is con- 
tinuous), if it is continuous at each point of X. If 7; and 72 are topologies 
on X and the identity mapping 7 : (X,71) > (X,72) is continuous, then 
we say that 7, is finer or stronger than 7, and that 72 is coarser or weaker 
than 7,. This happens if and only if T2 C 71. 
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e Suppose that f is a bijective mapping from X onto a topological space 


(Y,c). If f and f~! are both continuous, then f is called a homeomorphism 
of (X,7) onto (Y,o). 


Before investigating the use of these definitions, let us give some examples 


of topological spaces. The reader should verify that in each instance the 


conditions for being a topology are satisfied. 


1. 
2. 


If X is any set, let 7 = {0, X}. This is the indiscrete topology. 

If X is any set, let 7 = P(X), the set of all subsets of X. This is the 
discrete topology. It is the metric space topology defined by the discrete 
metric d, where d(z,y) = 1 if « # y and d(z,xz) = 0. 


. Suppose that Y is a subset of a topological space (X,7). Then ty = 


{ONY :O € rT} isa topology on Y, called the subspace topology. (Y, Ty) is 
then a topological subspace of (X,7). Topological subspaces inherit many, 
but not all, of the properties of the larger space. 


. Suppose that q is a mapping of a topological space (X,7) onto a set S. The 


collection {U CS: q7'(U) € T} of subsets of S is a topology on S, the 
quotient topology. In many cases, it is very badly behaved, and quotient 
topologies are a rich source of idiosyncracies and counterexamples. 


. If X is an infinite set, let tf be the collection of subsets of X with a finite 


complement, together with the empty set. This is the cofinite topology. 


. If X is an uncountable set, let 7, be the collection of subsets of X 


with a countable complement, together with the empty set. This is the 
cocountable topology. 


. Let 7 = {0} U {(—o0, a): aE R}U{R}. Then 7 is a one-sided topology 


on R; another is T+ = {0} U{(a,00) :a€ R} U {R}. 


. Let P denote the vector space of complex polynomials in two variables. 


If S is a subset of P, let 
Ug = {(21, 22) € C? : p(a1, 22) £0 for p € S}. 


Then it can be shown that the collection of sets {Us : S C Phisa 
topology on C?, the Zariski topology. This definition can be extended to 
other settings in algebraic geometry and in ring theory, where it is an 
important tool. (It does not have any clear use in analysis.) 


We now establish some elementary results about topological spaces. In 


many cases, the arguments are similar to those for the real line, or for metric 


spaces, and the details are left to the reader. 


Proposition 13.1.1 Suppose that B is a collection of subsets of a set X 
which satisfies 
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(i) If By, By © B then By Bg © B, and 
(ui) U{B:BeEB}=xX. 
Then there is a unique topology tT on X for which B is a base. 


Proof Let 7 be the collection of unions of sets in B. Then the empty set 
is in 7 (the union of the empty set of subsets of B) and X € 7, by (ii). 
Clearly the union of sets in 7 is in 7, and so it remains to show that finite 
intersections of sets in 7 are in T. For this, it is sufficient to show that if 
U = UcecC and V = UpepD are int (where C and D are subsets of B), then 
UNV € 7. But this holds, since UN V = Ucec,pep(C ND), which is in rT. 

It follows from the construction that 7 is unique. For if o is a topology on 
X for which B is a base, then o C 7, by the definition of a base, and tT Co, 
since the union of open sets is open. 


Let us give an example. The subsets [a,b) of R, where a < 6, satisfy the 
conditions of the proposition, and so define a topology, the right half-open 
interval topology on R. Note that 


(a,b) = Uf[(1 — Aja + Ab, b) 0 < A <1}, 


so that (a,b) is open in this topology; from this it follows that the usual 
topology on R is weaker than the right half-open interval topology. 


Proposition 13.1.2 Suppose that A and B are subsets of a topological 
space (X,d). 

(i) If ACB then ACB. 

(ii) A is closed. 

(iit) A is the smallest closed set containing A: if C is closed and ACC 
then ACC. 


Proof (i) follows trivially from the definition of closure. 
(ii) Suppose that b ¢ A. Then there exists a neighbourhood N of b such 

that NA = 0, and there exists an open set U such that b € U C N. If 

x €U then x ¢ A, since UM A = 9. Thus } is not in the closure of A. Since 

this holds for all b ¢ A, A is the closure of A and A is closed. 
(iii) By (i), ACC=C. 


Theorem 13.1.3 Suppose that (Y,Ty) is a topological subspace of a topo- 
logical space (X,T) and that ACY. Let A” denote the closure of AinY, 
and A’ the closure in X. 

(i) A =A* ny. 

(ii) A is closed in Y if and only if there exists a closed set B in X such 
that A= BY. 
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Proof (i) Certainly AY Cc Za. so that A” c A* AY. On the other hand, 
ify € A* ny and N isa T-neighbourhood of y in X, then there exists an 
open subset U in X such that ye UC N, and UN A#%. Now UNY isa 
Ty open set, y€ UNY and (UNY)NA = UNA is not empty. Thus y € A’. 

(ii) If A is closed in Y, then A = 2 =f A Y, so that we can take 
B= A“. Conversely if B is closed in X and A= BMY, then zx C B, so 
that A’ =f Ay Cenv=A, and A is closed in Y. 


Proposition 13.1.4 Suppose that A and B are subsets of a topological 
space (X,d). 

(i) If AC B then A° C BY. 

(ii) C(A®) = CCA). 

(iii) A is open if and only if C(A) is closed. 

(iv) A° is open. 

(v) A° is the largest open set contained in A: if U is open andU C A 
then U C A®. 


Proof This follows by making obvious modifications to the proof of 
Proposition 12.3.10. 


Thus, in the examples above, a subset F' of (X,7,) is closed if and only 
if it is finite, or the whole space, and a subset F' of (X,7,) is closed if and 
only if it is countable, or the whole space. 


Proposition 13.1.5 Suppose that (X,7T) is a topological space, that x € 
Xand that N, is the collection of neighbourhoods of x. Then Ny is a filter: 
that is 

(i) each N € N;, is non-empty; 

(it) if Ni, Na € N, then NiO No € Ny 

(iti) if NEN, and N CM then M EN,. 

A subset O of X is open if and only if it is a neighbourhood of each of its 
points. 


Proof (i) Since « € N, N is not empty. 

(ii) There exist open sets O; and O2 such that x € O; C N, and x € 
Og C No. Then O; M Og is open, and x € ON Og C Ny N No. 

(iii) This is trivial. 

If O is open, then it follows from the definition of neighbourhood that 
O is a neighbourhood of each of its points. Conversely, if the condition is 
satisfied, then O = O°, and so O is open. 
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The analogues of Theorem 12.2.3 and the sandwich principle also hold; 
simple modifications to the proofs are needed, and the details are left to the 
reader. 

Composition also works well; the proof is easy, but the result is of 
fundamental importance. 


Theorem 13.1.6 Suppose that f is a mapping from a topological space 
(X,7) into a topological space (Y,p) and that g is a mapping from Y into a 
topological space (Z,o). If f is continuous at a € X and g is continuous at 
f(a), then go f is continuous at a. 


Proof If N € Nof(q)) then, since g is continuous at f(a), g I(N)eé Nf (a)s 
and since f is continuous at a, f~!(g~!(N)) € Ng. The result follows, since 


(gof) (NK) =f tg "W)). 
The next result corresponds to Theorem 12.4.1. We give some details of 
the proof, though the proof is essentially the same. 


Theorem 13.1.7 Suppose that f is a mapping from a topological space 
(X,7) into a topological space (Y,o0). The following are equivalent. 

(i) f is continuous on X. 

(ii) If U is an open subset of Y then f-1(U) is open in X. 

(iti) If F is a closed subset of Y then f—'(F) is closed in X. 

(iv) If A is a subset of X then f(A) C f(A). 


Proof Suppose that f is continuous, that U is open in Y and that a € 
f~'(U). Then f(x) € U, andU € Nj). Since f is continuous at x, f~'(U) € 
N,, and so 2 is an interior point of f~'(U). Since this holds for all 2 € 
f-'(U), f-+(U) is open: (i) implies (ii). 

Conversely, suppose that (ii) holds. Suppose that a € X and that N is 
a neighbourhood of f(a). Then there exists an open set U in Y such that 
f(a) €U CN. Thena € f-'(U) C f71(N), and f-!(U) is open, by 
hypothesis, so that f~!(V) is a neighbourhood of a. Thus f is continuous 
at a. Since this is true for all a € X, (ii) implies (i). 

Since a set is open if and only if its complement is closed, and since 
f-1(C(B)) = C(f7!(B)), (ii) and (iii) are equivalent. 

The proof of the equivalence of (iii) and (iv) is exactly the same as the 
proof in Theorem 12.4.1. 


Corollary 13.1.8 Suppose that f is continuous and that A is a dense 
subset of X. Then f(A) is dense in the topological subspace f(X) of (Y,p). 
In particular, if X is separable, then so is f(X). 
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Proof This follows from condition (iv). 


Corollary 13.1.9 Suppose that f is a bijective mapping f from a topo- 
logical space (X,d) onto a topological space (Y,p). The following are 
equivalent: 

(i) f is a homeomorphism. 

(ii) U is open in (X,d) if and only if f(U) is open in (Y, p). 

(iii) B is closed in (X,d) if and only if f(B) is closed in (Y, p). 

(iv) f(A) = f(A) for every subset A of X. 

(uv) f(A°) = (f(A))° for every subset A of X. 


What about sequences? We shall see that there are some positive results, 
but that, in general, sequences are inadequate for the definition of topological 
properties. 


Proposition 13.1.10 Suppose that A is a subset of a topological space 
(X,7) and that be X. 

(i) If there exists a sequence (a;)72, in A\ {b} such that aj — b as j — 00 
then b is a limit point of A. 

(ii) If there exists a sequence (aj) Foy in A such that aj > b as j > o, 
then b is a closure point of A. 

The converses of (i) and (it) are false. 


Proof (i) If M*(b) is a punctured neighbourhood of b then there exists 
jo € N such that a; € M*(b) for j > jo. Thus AN M*(b) £0, so that bisa 
limit point of A. 

The proof of (ii) is exactly similar. 

We shall use the same example to show that the converses do not hold. 
Suppose that X is an uncountable set, with the cocountable topology 7, 
described above. Suppose that A is any uncountable proper subset of X. If 
x € X and if M*(x) is any punctured neighbourhood of X, then AN M*(z) 
is non-empty, so that A’ = A = X: every point of X is a limit point, and 
therefore a closure point, of A. Suppose that a; € A and that a; — b as 
j ~ o. Then N = X \ {aj : aj ¥ 6} is a neighbourhood of b, and so there 
exists jo such that a; € N for 7 > jo. Thus a; = b for 7 > jo: the sequence 
(a;)72, is eventually constant and b € A. Thus if c ¢ A there is no sequence 
in A which converges to c. 


Proposition 13.1.11 Suppose that (X,r) and (Y,c) are topological 


spaces, that (a;)Fo4 is a sequence in X which converges to a as 7 + co and 
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that f is a mapping from X to Y. If f is continuous ata then f(a;) > f(a) 
as 7 + oo. The converse is false. 


Proof If N € N(f(a)) then f-1(N) € Ng. There exists jo € N such that 
a; € f-1(N) for 7 > jo. Thus f(a;) € N for j > jo: f(aj) > f(a) as j > oo. 

Let X be an uncountable set, let 7, be the cocountable topology on X 
and let 7 be the discrete topology. The identity mapping i from (X,7,) into 
(X,7) has no points of continuity, but a sequence converges in (X,7,) if and 
only if it is eventually constant, in which case it converges in the discrete 
topology. 


Exercises 


13.1.1 Suppose that f is a mapping from a topological space (X,T) to a 
topological space (Y,c), and that A and B are two closed subsets of 
X whose union is X. Show that f is continuous on X if and only 
if its restriction to A and its restriction to B are continuous. Is the 
same true if ‘closed’ is replaced by ‘open’? What if A is open and B 
is closed? 

13.1.2 Give an example of a topology on N with the property that every 
non-empty proper subset of N is either open or closed, but not both. 

13.1.3 Let X be an infinite set, with the cofinite topology tr. What are 
the convergent sequences? Show that a convergent sequence either 
converges to one point of X or to every point of X. 

13.1.4 Give an example of a continuous mapping f from a metric space 
(X,d) to a metric space (Y, p) and a subset A of X for which f(A°) Z 
(f(A))°, and an example for which f(A°) Z (f(A))?°. 

13.1.5 Suppose that (X,7) is a topological space and that (Y,d) is a metric 
space. Let C,(X,Y) be the space of bounded continuous mappings 
from X into Y. Show that C,(X,Y) is a closed subset of the space 
(By (X), doo) of all bounded mappings from X to Y, with the uniform 
metric. 

13.1.6 Verify that the collection of subsets {(—oo, a): a € R} of R, together 

with 9 and R, is a topology 7_ on R. 
A real-valued function on a topological space (X, 7) is said to be upper 
semi-continuous at x if, given € > 0 there exists a neighbourhood N 
of x such that f(y) < f(x) +e for y € N. Show that f is upper 
semi-continuous at x if and only if the mapping f : (X,7) > (R,7_) 
is continuous at 2x. 
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13.2 The product topology 


Suppose that {(Xa,Ta)}aea is a family of topological spaces. Is there a 
sensible way of defining a topology on X = [[,<4 Xa? In order to see how 
to answer this question, let us consider a simple example. The space R4 
is the product of @ copies of R. Mi # = (21,...,2%q), let aj(2) = vy, for 


1 


<j <d. The mapping 7; : R? > R is the jth coordinate projection. If we 


give R@ and R their usual topologies, we notice four phenomena. 


e Since |7;(x) — 7;(y)| < d(z,y) (where d is the Euclidean metric on R%), 


each of the mappings 7 is continuous. 


e For 1 < j < d let us denote the set {1,...,d} \ {7} by d\ {j}. Suppose 


that y € R4\U. If x E R let kyj > R- R? be defined by 


(ky,j(@))j = @ and (Ky,j(@))i = yi for 1 € d\ {7}. 
Let 
Cyj ={e € R41: a; = y; forie d\ {i}; 


Cy; is called the cross-section of R? at y and the mapping ky; the cross- 
section mapping. Then the mapping ky; is an isometry of R onto C,,;. 

Suppose that f : (X,7) + Ris a mapping from a topological space (X, 7) 
into R¢. We can then write f(x) = (fi(x),..., fa(x)), where f; = 1; 0 f. 
If f is continuous, then the composition f; = 7; 0 f is continuous. But the 
converse also holds. Suppose that each of the mappings f; is continuous, 
that « € X and e > 0. Then for each 7 there exists a neighbourhood N; 
of a for which | f;(y) — f;(x)| < «/Vd for y € N;. Then N = ng_,N; isa 


be 


x 


I 
I 
Be ee a : 
kyo C 5 
x,2 
a CT 
I 
I 
I 


Figure 13.2. The cross-section C,z.2. 
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neighbourhood of x, and if y € N then 


d 
d( f(y), f(x) =| S lf -f@P] <«6 
j=l 


so that f is continuous at x. 

e On the other hand, suppose that f : R¢ — (Y,c) is a mapping from R? to 
a topological space (Y,o), that x € R?@ and that 1 < j < d. Let Dy) =a 
for i € d\ {j}, so that #; € R‘\t3}. If f is continuous, then the mapping 
fo kz, :R — (Y,c) is continuous, for 1 < j < n, but the converse need 
not be true. For example, the real-valued function f on R? defined by 
f(0,0) = 0 and f(z, y) = ry/(x? + y”) is continuous at every point of R? 
except (0,0), but the mappings x > f(z,y’): R > Rand y > f(2’,y) 
are continuous, for all x’ and y’. We need to distinguish these phenomena 
carefully. We say that f : R¢ — (Y,c) is jointly continuous at x if it is 
continuous, and say that it is separately continuous at x if the mapping 
f okg,,j; from R to (Y,o) is continuous at 2;, for 1 <j <d. 


We use these observations to motivate the definition of the product topol- 
ogy on X = [],¢4 Xa. We want to define a topology 7 on X for which each 
of the coordinate mappings 7, : (X,T) > (Xa,T) is continuous. Thus we 
require that if Ug is open in X,q then 


ti (Ug)=Uax |] Xe 
Bxa 


is in T. Since finite intersections of open sets are open, we also require that if 
F is a finite subset of the index set A and Ug is open in Xq, for each a € F, 


then 
() ra'Ua)=[[ Vax [] Xp 


ack aeF BEA\F 


is in 7. We can take these as a basis for the topology we need. 


Theorem 13.2.1 Suppose that {(Xa,Ta)}aea is a family of topological 
spaces, and that X = [[,ce,Xa- Let B be the collection of sets of the 
form (\eer nm, (Ux), where F is a finite subset of A, Tq is the coordinate 
projection of X onto Xo and Uy is open in (Xa,Ta)- 

(i) B is a base for a topology T on X. 

(it) Each of the coordinate projections Ta. : (X,T) > (Xa,Ta) is 
continuous. 
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(iit) If (Y,o0) is a topological space and f : (Y,0) > (X,T) is a mapping, 
then f is continuous if and only if each of the mappings 7, 0° f : (Y,0) > 
(Xa; Ta) 18 continuous. 

Further r is the unique topology on X for which (ti) and (iii) hold. 


Proof (i) Since X is the empty intersection of sets of the form 7z!(Ua) 


X € B. Suppose that U = Nacr7,1(Ua) and V = Ngeatg (Va) are in B. 
Then 


UNV = (Neer\eta (Ua) 9 (Neerneta (Ua Va)) A (Nsca\rng '(Va)) 


which is in B. Thus B is the base for a topology on X, by Proposition 13.1.1. 

(ii) is a consequence of the definition of B (take F = {a}), and 
shows that the condition in (iii) is necessary. On the other hand, 
suppose that the conditions are satisfied. If y © Y then the sets 
{B € B: f(y) € B} form a base of 7-neighbourhoods of f(y). If y «© Y 
and N =Naert,!(Ua) is such a neighbourhood then 


f'(N) = Maer f 7" (ta (Ua) = Maer (Ta? f)* (Ua): 


Since 7, © f is continuous, (7 0 f)~!(U) is a neighbourhood of y, and so 
therefore is f~'(N). Thus f is continuous. 

If o is a topology on X for which (ii) and (iii) hold, then the sets in 
B must be in o, since the coordinate mappings are continuous, and so 
7 Co. On the other hand, consider the identity mapping i : (X,7) > (X,¢). 
Each of the mappings 7, 07 : (X,T) > (Xq,7T) is continuous, and so i is 
continuous, by hypothesis. Thus if U € o then i-'(U) = U € 7, and so 
o CT. 


The topology 7 of this theorem is called the product topology on X = 


Hae Xq. 
Similar remarks apply about the need to distinguish between joint 


continuity and separate continuity. 
We also have a result concerning cross-sections. Suppose that 8 € A and 
that y € [] 4,1} Xa. Let ky,g : Xg + X be defined by 


(kale) jg =2 and (hy e(2))a = ve loro eA \ (B}. 


Let 
Cu pg= ie =X tae ye tore AN ey 


Cy, is called the cross-section of X at y and ky g the cross-section mapping. 
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Corollary 13.2.2 If 8 © A andy € [Laygey Xq, the cross-section map- 
ping ky, is a homeomorphism of (Xg,T3) onto Cyg, when Cy g is given the 
subspace topology. 


Proof It follows from the definition of 7 that ky g is continuous. On the 


other hand, koe is the restriction of mg to Cy g. 


In particular, if ((X1,71),..-,(Xn,7)) is a finite product of topological 
spaces, then the collection of sets U; x --- x Un, where U; € 7; for l <j <n, 
is a base for the product topology on ee Xj. 

Similarly, if A = N, so that we have a product of an infinite sequence of 
topological spaces, then the collection of sets Uy x --- x Un x 1 beeen XG, 
where n € N and U; € 7; for 1 <7 <n, is a base for the product topology 
on jen Xj. 

Suppose that there is a topological space (X,7) such that (Xq,T.) = 
(X,7) for each a in A. In this case, we can identify the product [[,¢4 Xa 
with the space X4 of all functions from A into X. In this case, the product 
topology is referred to as the topology of pointwise convergence. Suppose 
that b is a limit point of a subset C of a topological space (Y,o), that f is a 
mapping of C into X4, and that | € X4. Ifc € C, then f(c) is a function on 
A taking values in X, which we denote by f., and / is also a function on A 
taking values in X. Then f, > /, in the topology of pointwise convergence, 
as c + b if and only if f.(a) > l(a) in X as c—> b, for each a € A. 

The product topology is a very weak topology. Consider the vector space 
R-4) of all real-valued functions on [0,1] with the topology of pointwise 
convergence. This is a big set — it is bigger than R. Nevertheless, it is sep- 
arable: let us show that the countable set of all polynomials with rational 
coefficients is a dense subset. Suppose that f €¢ Rl] and that N is a neigh- 
bourhood of f in the product topology. Then there exists a finite subset F’ 
of [0,1] and € > 0 such that 


{gE RO: g(t) — f(t] <eforte F} CN. 


There exists a real polynomial p of degree |F'| — 1 such that p(t) = f(t) 
for t € F’, and there exists a polynomial q with rational coefficients, of the 
same degree, such that |q(t) — p(t)| < € for t € F. Thus g € N, so that the 
countable set of all polynomials with rational coefficients is a dense subset 
of R11), 

Here is another example. Give the two-point set {0,1} the discrete topol- 
ogy. Recall that a mapping f from a set X into {0,1} is called an indicator 
function. We write f = I4, where A= {x: f(x) = 1}, and call I, the indi- 
cator function of A. The space {0,1}* of all indicator functions on X, with 
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the product topology, is called the Bernoulli space of X, and is denoted by 
Q(X). The space Q(N), whose elements are sequences, taking values 0 or 1, 
is called the Bernoulli sequence space. The mapping I : A > I, is a bijection 
of the power set P(X) onto Q(X). We can therefore define a topology on 
P(X) by taking the collection {I~!(U) : U open in Q(X)} as the topology 
on P(X); we call this topology the Bernoulli topology. If A is a subset of X, 
and F is a finite subset of X, let Nr(A) ={B CX: BNF =ANF}. Then 
the collection {Np(A) : F a finite subset of X} forms a base of open neigh- 
bourhoods for A in the Bernoulli topology. Some properties of the Bernoulli 
space (2({0, 1]) are investigated in the exercises. 


Exercises 


13.2.1 Suppose that (Xq,7,) and (X2,72) are topological spaces and that 
A, C X; and Ag C Xo. Show that if X, x X92 is given the product 
topology then 


O(A, x Ag) = (OA, x Ag) U (Ay x OAg2). 


13.2.2 Suppose that {(Xq, Ta) }aea is a family of topological spaces, and that 
A is the disjoint union of non-empty sets A; and Aj. Show that the 
natural mapping from X = [[,¢4 Xa onto [[ye4, Xa X Tea, Xa 
is a homeomorphism, when each of the spaces is given its product 
topology. 

13.2.3 Define a partial order on the topologies on a set X by saying that 
T < 72 if | C 7%). Suppose that 7, and 72 are topologies on X. Let 
6: X — X x X be the diagonal mapping defined as d(x) = (2,2). 
Let 7, X T2 be the product topology on X x X, and let o = {6-!(U) : 
U € 7 X 72}. Show that o is a topology on X, and that it is the least 
upper bound of 7, and 7). Show that 7, and 72 have a greatest lower 
bound, and determine its elements. Thus the topologies on X form a 
lattice. 

13.2.4 Give P([0,1]) the Bernoulli topology. 

(i) Show that P([0,1]) is separable. (Hint: consider step functions.) 
(ii) Let A,([0,1]) be the collection of subsets of [0,1] with exactly 
n elements and let B,([0,1]) be the collection of subsets of 
(0, 1] with at most n elements. Show that A,,([0,1]) is a discrete 
subspace of P((0, 1]). 
(iii) Show that B,,([0, 1]) is the closure of A,,([0, 1]). 
(iv) Show that B,,({0,1]) is not separable. 
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13.3 Product metrics 


What can we say about products of metric spaces? First we consider finite 
products. 


Proposition 13.3.1 Suppose that ((Xj,dj))?_, is a finite sequence of met- 
ric spaces. Then there is a metric d on X = es X; such that the metric 
space topology defined by d is the same as the product topology. Further, we 
can choose d so that each of the cross-section mappings ky; is an isome- 
try, and so that the coordinate mappings 7; are Lipschitz mappings, with 
constant 1. 


Proof There are many ways of doing this. The easiest is to consider a 
norm ||.|| on R” with the properties that if 0 < a; < b; for 1 <j <n then 
I|(a1,---,@n)|| < |](01,---,6n)||, and that |le;|| = 1 for each unit vector e; in 
R” - we could, for example, take 


lel, = jaa leyl, or 
el = Welle = (CFea les?)"?, or 
l2llse = max{|2y|2 17 <a}. 


We then set d(z,y) = ||(di (1, y1),---,dn(@n; Yn))||. Then d is a metric on 
X. Clearly, d(x, y) = d(y,x) and d(z,y) = 0 if and only if x = y; since 


dn, 2) = ||\(d ey 21) sexta ins Sn) )|| 


= I (da (1, y1) ate di(yi, ZA); see »An( Ens Yn) =F dn (Yn; Zn))|| 
S (di (71, 91), «+ An(@ns Yn)) I] + [dr 1, 21), «++ In(Yns Zn) | 
= d(x,y) + dy, 2), 


the triangle inequality holds. If y € nyt X;, and z,2’ € X;, then 
A(kyj (x), kyj(x')) = ||dj(x,x')e;|| = dj(z,2"), so that the cross-section 
mapping k, ; is an isometry. 

Ifl<j<nanda,ye€X then 


dj (75 (x), 75 (y)) = lla; (77 (x), 73 (y) ejll < a(x, y), 


so that the coordinate mappings are Lipschitz mappings with constant 1. 
Finally, suppose that f is a mapping from a topological space (Y,o) into X 
for which each of the mappings f; = 7; 0 f is continuous, that y ¢ Y and 
that « > 0. For each 7 there exists a neighbourhood Nj; of y such that if 
z € Nj then d;(fj(z), fi(y)) < €/n. Let N =N7_,Nj; N is a neighbourhood 
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of yin Y. If z € N then 


= Ilda (f(z), Ay) eal 5 tiie Ilan (fn (2); fr(y))enll <6, 


so that f is continuous. 


When (X;,d;) is R, with its usual metric, the metric d is simply the 
metric given by the norm ||.||. 

A metric which satisfies the conclusions of this proposition is called a 
product metric. Suppose that each of the spaces is a normed space, and 
that the product metric is given by a norm. Then the norm is called a 
product norm. For example, the Euclidean norm is a product norm on R? = 
ene d;), where (X;,d;) is R, with its usual metric, for 1 <j <d. 

Here is another example. 


Proposition 13.3.2 Suppose that (X,d) is a metric space. Then the real- 
valued function (x,y) + d(a,y) on (X,d) x (X,d) is a Lipschitz mapping 
with constant 2 when (X,d) x (X,d) is given a product metric p. 


Proof By Lemma 11.1.14, |d(x, y)—d(2’, y’)| < d(x, x')+d(y, y’); the result 
follows since 


d(x, a") < p((x,y), (2’,y’)) and d(y,y’) < p((a,y), (2, y')). 


What is more interesting is that a result similar to Proposition 13.3.1 
holds for countable infinite products of metric spaces. 


Theorem 13.3.3 Suppose that ((X5, 45 )) Fa is a countably infinite 
sequence of metric spaces. Then there is a metric p on X = jen X; such 
that the metric space topology defined by p is the same as the product topol- 
ogy. Further, we can choose p so that the cross-section mappings ky; are 
Lipschitz mappings, with constant 1. 


Proof We need a preliminary result, of interest in its own right. 


Lemma 13.3.4 Suppose that ¢ is a continuous increasing real-valued func- 
tion on [0,co) for which o(t) = 0 if and only if t = 0, and for which the 
function w on (0,00) defined by W(t) = G(t)/t is decreasing. If (X,d) is a 
metric space, then the function p defined by p(x,y) = o(d(x,y)) is a met- 
ric on X equivalent to d. If w(t) < K for all t € (0,00) then the identity 
mapping (X,d) + (X,p) is a Lipschitz mapping with constant K. 
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Proof First we show that p is a metric. Certainly, p(x, y) = 0 if and only 
if c = y, and p(x, y) = p(y, x). Note that if a > 0 and 6 > 0 then 


b(a +b) = (a + b)b(a + b) < ayp(a) + by(b) = G(a) + O(0). 
Now suppose that x,y,z € X. Then 
p(x, z) = d(d(x, z)) < (d(x, y) + dly, 2) 
< $(d(z,y)) + (aly, z)) = o(z,y) + ply, 2)- 


Thus the triangle inequality holds. 

Suppose that «€ > 0. Since ¢ is continuous at 0, there exists 6 > 0 such 
that if 0 <t <6 then 0 < ¢(t) < «. Thus if d(z,y) < 6 then p(z,y) < €, and 
the identity mapping 7 : (X,d) — (X,:) is continuous. Conversely, if 7 > 0 
and 0 < d(t) < o(n) then 0 < t < 7. Thus if p(z,y) = o(d(z,y)) < ¢(n) 
then d(x,y) <7 and the identity mapping 7: (X,p) > (X,d) is continuous. 

If ~ is bounded by K, then ¢(t) < Kt and so p(x,y) < Kd(z,y). 


In particular, if @ is bounded then p is a bounded metric on X. Popular 
functions with this property are ¢-(t) = t/(1+ ct), and ¢,(t) = min(c,t), 
where c > 0; 0 < ¢-(t) < c. In each case the corresponding function ~, is 
bounded by 1, so that the Lipschitz constant can be taken to be 1. 


We now return to the proof of Theorem 13.3.3. Again, there are 
yl 
sequence of positive numbers for which ya cj; < oo (for example, take 


many ways of defining a suitable metric. For example, let (c;) be a 
c; = 1/2). For each j let p; be a metric on Xj; which is equiva- 
lent to d;, which is bounded by c;, and for which the identity mapping 
(X;,d;) + (Xj, p;) is a Lipschitz mapping, with Lipschitz constant 1. Define 
a real-valued function p on X x X by setting 


pau) = S— pj (a50)). 
j=l 


The conditions that we have imposed show that this sum is finite. First, we 
show that p is a metric on X. Clearly, d(x,y) = d(y,x), and d(z,y) = 0 if 
and only if «= y. If z,y,z © X then 


Co co 
p(a,z) = S— ajay, 25) < S_ (05 (m5, 49) + 25 (47,25) 
j=l j=l 
CO CO 
= S> pj(05,4;) + © py (uss 2) = (zy) + oy, 2), 
j=l j=l 


so that the triangle inequality holds. 


13.8 Product metrics 369 


Next, we show that the metric topology on X defined by p is the product 
topology; we show that it satisfies the conditions of Theorem 13.2.1. If 7 ¢ N 
and x,y € X then p;(7;(x),7;(y)) < p(x, y), so that 7; is continuous, and 
(ii) is satisfied. Suppose that f is a mapping from a topological space (Y,c) 
into (X,p), and let f; = 1;0 f. If f is continuous, then, since each mapping 
7; is continuous, each of the mappings f; is continuous. On the other hand, 
suppose that each of the mappings f; is continuous. Suppose that y € Y 
and that « > 0. We must show that f~'(N.(f(y))) is a neighbourhood 
of y. There exists jg such that paar co = ef2. For 1 = 7 <= jp, tet 
U; = {ax € X;: p;(x, fj(y)) < €/2jo}; U; is an open neighbourhood of f;(y) 
in X;. Let U = Mieinte (ey) Since each of the mappings 7; is continuous, 
U is an open neighbourhood of y in (Y,o). If z € U then 


Jo lee) 
p(f(2), f(Y) => 5 (F265) + SS o,(Fi(2), f(y) 
j=1 j=jot1 


< Yet 3 cj <6, 


J=Jjo+1 


so that U C f-1(N-(f(y))), and f-!(N-(f(y))) is a neighbourhood of y. 
Finally, it follows from the construction that each of the cross-section 


mappings k, ; is a Lipschitz mapping, with constant 1. 


A metric which satisfies the conditions of this theorem is called an infinite 
product metric, or, simply, a product metric. 
As examples, the metrics 


ra | : . 
pny) =p ee and o(z, y) = Lmintt Cale) 


are frequently used. 

Let us give three examples of countable products of metric spaces. We 
can give the vector space RN of all real sequences the topology of pointwise 
convergence. This is metrizable, and a suitable product metric is 


[o@) 
|zj — yj 
de.y) = re 
yy 2I(1 + |x; — yl) 


This is separable, since the countable set of sequences with rational terms, 
all but finitely many of which are zero, is dense in RN 

Next, let H = [0,1JN = ja I;, where I; = [0,1] for 7 € N, with 
the product topology. Then H is a closed subset of RN. There are many 
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product metrics which define the product topology on H. One such metric 
is p(x,y) = (524 lay — yj|?/j?)'/?. The mapping m : H — ly defined by 
m(x) = (x%;/7)F<, is then an isometry of H onto the subset H2 of the inner 
product space ly defined as 


fe={re bi 0s 7 < 1/9 mr il< 7 = oo}. 


Hy is called the Hilbert cube;. 

Thirdly, the Bernoulli sequence space Q(N) can be considered as a closed 
subset of H. One product metric which defines the topology on Q(N) is 
given by setting d(a,y) = 2 ee |x; — y;|/3’. With this metric, if c € Q(N) 
then 

No/3i = {y € QUIN) : ys = a; for 1 <7 < J}. 


If z € Q(N), let s(x) = 25052, z;/3). Then s is an isometry of (Q(N),6) 
onto Cantor’s ternary set. As we have seen, 2(N) can be identified with 
P(N), with the product topology, and so P(N), with the product topology, 
is homeomorphic to Cantor’s ternary set. 


Exercises 


13.3.1 Suppose that (Seat) is a finite sequence of metric spaces. If 
r= (x,)4 and y= (,)_{ are in X = [fy A; let 
J 
pi(z,y) = 2 (aw) and poo(r,y) = ee di (xi, yi). 
— 
Show that p; and po are product metrics on X and that if p is 
any product metric on X, then po < p < p,. Deduce that any two 
product metrics on X are Lipschitz equivalent. 


13.3.2 Let w = RN be the vector space of all real sequences. Show that 
there is no norm on w which defines the product topology on w. 


13.4 Separation properties 


If 7 is the metric space topology of a metric space (X,d) then 7 has certain 
properties which other topologies do not possess. In this section and the 
next, we shall introduce some of these properties. 

If X is a set with the indiscrete topology, it is not possible to distinguish 
between points topologically. In order to be able to do so, it is necessary to 
introduce separation properties. We shall introduce five of them. 
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e A topological space (X,7) is a T; space if each singleton set is closed. 

e A topological space (X,7) is a Hausdorff space if whenever x and y are 
distinct points of X there exist disjoint open sets U and V such that 
zrEU,yeV. 

e A topological space (X,7) is regular if it is Hausdorff and whenever A is a 
closed subset of X and x is an element of X which is not in A, then there 
exist disjoint open sets U and V such that r © U, ACV. 

e A topological space (X,7) is completely regular if it is Hausdorff and when- 
ever A is a closed subset of X and x is an element of X which is not in A 
then there exists a continuous function f : X — [0,1] such that f(x) = 0 
and f(a) =1 fora€ A. 

e A topological space (X,7) is normal if it is Hausdorff and whenever A 
and B are disjoint closed subsets of X then there exist disjoint open sets 
U and V such that ACU, BCYV. 


Unfortunately, terminology varies from author to author; the issue is whether 
or not the Hausdorff condition should be included in the last three defini- 
tions. It is therefore sensible to be cautious, and, for example, to refer to 
a ‘regular Hausdorff space’. As we shall see, the conditions are listed in 
increasing order of restrictiveness. 

It follows from the definition that a topological space (X,7) is a T, space 
if whenever x and y are distinct points of X there exists an open set U such 
that y € U and « ¢ U, so that a Hausdorff space is a T) space. 


Proposition 13.4.1 In a Hausdorff space, limits are unique. Suppose that 
f is a continuous mapping from a topological space (Y,o) into a Hausdorff 
topological space (X,7T), that b is a limit point of X and that f(x) > | as 
z—b. Then | is unique. 


Proof If f(a) > mas x > b, and 1 4 m then there exist disjoint open 
sets U and V in X such that 1 © U and m € V. But then there exist 
punctured neighbourhoods N7;(b) and Ny(b) of 6 such that f(NG(b)) C U 
and f(N;,(b)) C V. But this implies that Nj; Ny = 9, contradicting the 
fact that b is a limit point of Y. 


In fact, the condition is also necessary (Exercise 13.4.1). 


Proposition 13.4.2 A topological space (X,T) is regular if and only if 
it is Hausdorff, and each point has a base of neighbourhoods consisting of 
closed sets. 


Proof Suppose that (X,7) is regular, that  € X, and that N € N,. 
There exists an open set O such that « € O C N. Then X \O is closed, and 
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x ¢ X \ O, and so there exist disjoint open sets U and V with x € U and 
X\OCV. Then 

rEUCX\VCOCN, 
so that X \V is a closed neighbourhood of x contained in N. Thus the closed 
neighbourhoods of x form a base of neighbourhoods of x. The proof of the 


converse is left as an exercise (Exercise 13.4.2). 


Proposition 13.4.3 A completely regular topological space (X,T) is 
regular. 


Proof Suppose that A is a closed subset of X and that « ¢ A. Then there 
exists a continuous mapping of X into [0,1] with f(z) =0 and f(a) = 1 for 
a€é A. Let 


U={yeX: fly) <1/2}, V={ye X: f(y) > 1/2}. 


Then U and V are disjoint open subsets of X, andz<€U, ACV. 


Complete regularity has a different character to the other separation con- 
ditions, since it involves real-valued functions; but a great deal of analysis 
is concerned with continuous real-valued functions on a topological space. 


Proposition 13.4.4 A topological space (X,T) is completely regular if and 
only if whenever x € X, By is a base of neighbourhoods of x and B € By, 
then there exists a continuous function f : X — [0,1] such that f(x) = 0 
and f(y) =1 fory ZB. 


Proof Suppose that (X,7) is completely regular and that B is a basic 
neighbourhood of a point « € X. Then « € B° C B, and X \ B° is closed, 
and so there exists a continuous function f : X — [0,1] such that f(x) = 0 
and f(y) =1 for y ¢ B°. Thus f(y) = 1 for y ¢ B. 

Conversely, suppose that the conditions are satisfied, that « € X, that 
A is a closed subset of X and that « ¢ A. Then there exists B € B, such 
that BO A= 0. There exists a continuous function f : X — [0,1] such that 
f(z) =0 and f(y) =1 for y ¢g B. Then f(a) =1 forae A. 


It is easy to verify that a topological subspace of a T, space (Hausdorff 
space, regular space, completely regular space) is a T, space (Hausdorff 
space, regular space, completely regular space), and that a closed subspace 
of a normal space is normal. As we shall see (Example 13.6.9), not every 
subspace of a normal space is normal. 


Proposition 13.4.5 If {(Xa,Ta):a@€ A} is a family of T, space (Haus- 
dorff spaces, regular spaces, completely regular spaces) then the product 
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X = [[gca Xa is a Ty space (Hausdorff space, regular space, completely 
regular space) when it is given the product topology T. 


Proof Suppose x and y are distinct elements of X. There exists a € A 
such that tq # Ya. If (Xa,Ta) is T; then there exists Ua € Tq such that 
Lq € Ug and Ya ¢ Ug. Then U = a, (Us) is open in X anda eU,y ¢U, 
so that X is T;. An exactly similar argument shows that X is Hausdorff if 
each of the spaces (Xq, Tq) is Hausdorff. 

Suppose next that each space (Xq,7q) is regular. If x € X then each 
Lq has a base Bz. of neighbourhoods consisting of closed sets. Then the 
collection 


{Naerta*(Na(ta)) : F finite, No(te) € Bea} 


of subsets of X is a base of neighbourhoods of x consisting of closed sets. 
Thus (X,7) is regular. 

Finally, suppose that each (Xq,T7q.) is completely regular, and that 
NacrTa '(No(2x)) is a basic neighbourhood of «. For each a € F there exists 
a continuous mapping fa : Xq — [0,1] such that fo(%q) = 0 and fa(ya) = 1 
for ya ¢ Na(taq). Set f(x) = maxeaer fa(mta(x)). Since each mapping fy oa 
is continuous on (X,7), the function f is a continuous function on (X,7). 
Further, f(z) = 0, and f(y) = 1 if y € Neert,'(No(z)). Thus (X,7) is 
completely regular. 


We shall see (Example 13.6.11) that the product of two normal spaces 
need not be normal. 

A normal space is completely regular; this follows immediately from the 
principal result of this section. 


Theorem 13.4.6 (Urysohn’s lemma) If A and B are disjoint closed sub- 
sets of a normal topological space (X,T) then there exists a continuous 
mapping f : X —> [0,1] such that f(a) = 0 fora € A and f(b) = 1 for 
be B. 


Proof Let D be the set of dyadic rational numbers in [0,1] - numbers 
of the form p/2” with p and n in Zt, and p < 2”. Using the axiom of 
dependent choice, we define a family {Uq : d € D} of open subsets of X 
with the properties that 


e if dy < dy then Ug, C Ug,, and 
e ACU) and U, =X |B. 


We begin by setting U; = X \ B. Since A and B are disjoint closed sets, 
there exist disjoint open subsets V and W such that A C V and B CW. 
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Then, since X \ W is closed, 
ACVCVCX\WCX\B=N, 


and so we can take Up = V. 

We now re-iterate this argument. Suppose that we have defined Uj for all 
d which can be written in the form d = p/2™, with m < n. Suppose that 
d = (2k +1)/2"*1. Let 1 = k/2” and let r = (k + 1)/2". Then] <d <r, 
and U; and U, have been defined. Since U; and X \ U, are disjoint closed 
sets, there exist disjoint open subsets V and W such that U; C V and 
X \U, CW. Then, since X \ W is closed, 


U,CVCVCX\WCU,. 


We can therefore take Ug = V. 

We now use this family to define the function f. If  € B, we set f(x) =1 
and if c € X \ B we set f(x) = inf{d € D: « € Ug}. Then 0 < f(z) <1, 
f(b) = 1 for b € B, and, since A C Up, f(a) = 0 for a € A. It remains to 
show that f is continuous. For this we use the fact that D is dense in [0, 1]. 

First, suppose that 0 < a < 1. Let Vy = U{Ug : d < a}. Va is open; we 
shall show that Va = {x € X : f(x) < a}. If f(x) < a there exists d € D 
with f(z) <d <a, and so x € Uy C Vg. Thus {x € X: f(x) < a} C Va. 
On the other hand, if x € V, then x € Ug for some d < a, and so f(x) <a. 
Thus Vy C {a € X : f(x) < a}. Consequently, Va = {a € X : f(x) < a}. 

Next, suppose that 0 < B < 1. Let Wg = U{X \Ua:d > B}. Wg is 
open; we shall show that Wg = {x ¢ X : f(x) > GB}. If f(x) > 6 there 
exists d € D with 6 < d < f(x), and so x ¢ Ug. Thus x € Wa, and so 
{x € X : f(x) > B} C Wg. On the other hand, if z € Wg then x € X \Uq 
for some d > £. There exists e € D with d < e < 8. Then z € X \ Uz, so 
that f(z) >e > d> B. Thus Wg C {x € X : f(x) > B}. Consequently, 
Wg={xeEX: f(x) > B} 

Thus if0< 6<a<1 then 


{Perk tp =< fia) <ah=WeliVe 


is an open set; from this it follows that f is continuous. 


Note that Exercise 12.4.4 shows that this theorem holds for metric spaces. 
If A and B are closed disjoint subsets of a metric space (X,d), and f isa 
continuous mapping of X into [0,1] for which f(a) = 0 for a € A and f(b) = 
1forbeé B,thnU={xeEX: f(x) < 1/2} andV={xeEX: f(x) > 1/2} 
are disjoint open sets which separate A and B. Consequently, a metric space 
is normal. 
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Exercises 


13.4.1 Suppose that (X,7T) is a topological space with the property that 
whenever f is a continuous mapping from a topological space (Y,c) 
into (X,7), 6 is a limit point of Y, and f(x) > 1 as x > b, then / is 
unique. Show that (X,7) is Hausdorff. 

13.4.2 Suppose that every point x in a Hausdorff topological space has a 
base of neighbourhoods consisting of closed sets. Show that the space 
is regular. 

13.4.3 Suppose that f is a continuous mapping from a topological space 
(X,7) into a topological space (Y,o). The graph G'y of f is the set 
{(x, f(x)) : « € X}. Show that if (Y,c) is a T, space then Gy is 
closed in X x Y, when X x Y is given the product topology. Give an 
example to show that the 7 condition cannot be dropped. 

13.4.4 Suppose that (X,7) is a topological space. Show that the following 
are equivalent: 

(a) (X,7) is Hausdorff; 

(b) the diagonal{(z, x) : x € X} is closed in (X,7T) x (X,7); 

(c) whenever f and g are continuous mappings from a topological 
space (Y,a) into (X,r), the set {yEY: f(y)=g(y)} is closed 
in Y. 

13.4.5 Suppose that (X,7) is not a Hausdorff space, and that y and z cannot 
be separated by open sets. Let 7: X — X be the identity mapping. 
Show that i(x) > y and i(z) > zasu—y. 


13.5 Countability properties 


There are several countability properties that a topological space (X,7) 
might possess. We list the three most important of these: 


e (X,7) is first countable if each point has a countable base of neighbour- 
hoods; 

e (X,7) is second countable if there is a countable base for the topology; 

e (X,7) is separable if there is a countable subset of X which is dense in X. 


A metric space is first countable (the sequence (Nj /,(x))72 is a countable 
base of neighbourhoods of x) but need not be second countable (consider an 
uncountable set with the discrete metric). 

Here are some elementary consequences of the definitions. 


Proposition 13.5.1 (i) A subspace of a first countable topological space 
is first countable. 
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(it) A countable product of first countable topological spaces is first 
countable. 

(itt) A countable product of second countable spaces is second countable. 

(iv) A countable product of separable topological spaces is separable. 

(v) If f is a continuous mapping of a separable topological space (X,T) into 
a topological space (Y,a) then f(X), with the subspace topology, is separable. 
In particular, the quotient of a separable topological space is separable. 


Proof (i), (ii) (iii) and (v) are easy consequences of the definitions, and 
the details are left as exercises for the reader. 

(iv) We give the proof for a countably infinite product; the proof for a 
finite product is easier. Suppose that ((X;, 75) )Foa is a sequence of separable 
topological spaces, and that (X,7) = Te (43, 7). Let Cj be a countable 
dense subset of X;, for 1 < j < oo. If any X; is empty, then the product is 
empty, and therefore separable. Otherwise, choose y; € Cj for 1 < 7 < oo. 
We consider y = (Yj )Foa as a base point in the product. For 1 < 7 < oo, let 


Ag={2 eX 1a, = CG; for 1 <7 < 9, a =y; ford Sg}. 

Then (A;)?2, is an increasing sequence of countable subsets of X. Let A = 
§214;. A is countable; we show that it is dense in X. Suppose that « € X 
and that N € N;. Then there exists jo € N and neighbourhoods N; € Nz, 
for 1 < 7 < jo such that N D rele 0G). Since there exists a € Aj, such 
that a; € N; for 1 <j < jo, NNA is not empty. 


Here are some results concerning first countability. The last three show 
that in the presence of first countability, certain topological properties can 
be expressed in terms of convergent sequences. 


Proposition 13.5.2 Suppose that (X,7) is a first countable topological 
space, that A is a subset of X, and that x © X. 

(i) There is a decreasing sequence of neighbourhoods of x which is a base 
of neighbourhoods of x. 

(it) The element x is a limit point of A if and only there is a sequence in 
A \ {x} such that rz, > © as n> oo. 

(itt) The element x is a closure point of A if and only if there is a sequence 
in A such that tn > © asn—- oo. 

(iv) If f is a mapping from X into a topological space (Y,o) then f is 
continuous at x if and only if whenever (%p)°°1 is a sequence in X for which 
In > xX asn-—> oo then f(xy) > f(z). 
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Proof (i) Suppose that (Nj)72, is a countable base of neighbourhoods 
of z. Let M; = ri_, Mi. Then (M;)9°, satisfies the requirements; it is a 
decreasing base of neighbourhoods of «x. 

(ii) The condition is certainly sufficient. It is also necessary. Suppose 
that x is a limit point of A. Let {M, : k © N} be a decreasing 
countable base of neighbourhoods of x. For each & € N there exists 
Lp © (My \ {x}) NA. If N € N;, there exists k such that M;, C N. Then 
In € N\ {x} for n > k, and so rz, > © as n> ov. 

(iii) is proved in exactly the same way. 

(iv) If f is continuous, then the condition is satisfied. Suppose that the 
condition is satisfied, and that f is not continuous at x. Then there exists 
a neighbourhood N of f(x) such that f~'(N) is not a neighbourhood of 
a. Thus for each k € N there exists 2, € My \ f~'(N). Then zz > 2 as 
k — co, but f(z,) ¢ N for any k € N, and so f(rz) A f(x) as k > on. 


Here are some results concerning second countability. 


Proposition 13.5.3 (i) A second countable topological space (X,T) is first 
countable and separable. 
(ii) A metric space (X,d) is second countable if and only if it is separable. 


Proof (i) Suppose that (Uj), is a basis for 7. If « € X then 
{U; : x € U;} is a base of neighbourhoods for x, and so X is first countable. 
If U; #0, choose x; € U;. Then A = {x; : U; # O} is a countable subset 
of X; we show that A is dense in X. If x € X and N € Nj, there exists a 
basic open set U; such that x ¢ U; C N. Then 2; € ANU; C ANN, so that 
NOAFO. 

(ii) By (i), we need only prove that a separable metric space (X,d) is 
second countable. Let C be a countable dense subset of X. We shall show 
that the countable set B = {Njjp(z):n € N: x € C} is a base for the 
topology. Suppose that U is open. Let V = U{B € B: B CU}; V is an 
open set contained in U. Suppose that x € U; then there exists « > 0 such 
that N.(x) € U. Choose n such that 1/n < €/2. There exists c € C such 
that d(c,x) < 1/n. If y € Ny/,(c) then 


d(y,x) < d(y,c) + d(c,z) < 1/n+1/n<e, 


so that y € U. Thus x € Nj/,(c) C V. Thus U C V, and so U = V. 
Consequently B is a base for the topology. 
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Corollary 13.5.4 A subspace of a separable metric space is separable. 


Proof For it is a subspace of a second countable space, and so is second 


countable. But a second countable space is separable. 


This result does not extend to topological spaces (Example 13.6.11). 
We now prove two substantial theorems concerning second countable 
topological spaces. 


Theorem 13.5.5 A regular second countable topological space (X,T) is 
normal. 


Proof Let B be a countable base for the topology. Suppose that C and 
D are disjoint closed subsets of X. Let C = {U ¢ B: UNC = 0} and let 
D={U€B: UND = 9}; let Vi,Ve,... be an enumeration of C and let 
W,, Wo,... be an enumeration of D. For 1 < 7,k < ~, let 


j k 
P; = W; \ (Ur) and Qk = Vi \ (Ur 5 

i=1 i=1 
P; and Q, are open. If 7 > k then P;| OV; = 9, and so P| NQx = O. Similarly, 
PNQp = Vik > 7. Let P= Ure Pj, Q = UR21Q,. Then P and @ are 

open and, 
PNQ=Uj,nen(Pj 1 Qe) = 0. 

If x € C then, since (X,7) is regular, there exists W; € D such that « € Wj. 
But x ¢ V; for 1 <i <j, and so z € P; C P. Since this holds for all x € C, 
C C P. Similarly, D C Q. Thus (X,7) is normal. 


Recall that H is the set {0,1]N = jes I;, where I; = [0,1] for 7 € N, 
with the product topology. 


Theorem 13.5.6 (Urysohn’s metrization theorem) A regular second 
countable topological space (X,7) is metrizable. There exists a homeomor- 
phism f of (X,T) onto a subspace f(X) of H. 


Proof Since H is metrizable, and a subspace of a metrizable space is 
metrizable, it is sufficient to prove the second statement. Let B be a 
countable base for the topology. Let 


S={(U,V)€BxB:UCV}. 


S is a countable set; let us enumerate it as (5;)%°,. 

If s; = (U;,Vi) € S then by Urysohn’s lemma there exists a continuous 
mapping f; : (X,r) > [0,1] such that f;(x) = 0 if x € U; and f;(x) = 1 
if « ¢ V;. We can therefore define a mapping f : (X,rT) > H by setting 
(f(x)); = fi(a). Since each of the mappings f; is continuous, f is continuous. 
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We now use two very similar arguments to show first that f is injective 
and secondly that f~! : f(X) > X is continuous. Suppose that x 4 z. Since 
(X,7) is a Ty space, there exists V € B such that x € V and z ¢ V and, 
since (X,7) is regular, there exists U € B such that x € U and U CV. Then 
(U,V) € S; let (U,V) = s;. Then f;(x) = 0 and f(z) = 1. Consequently 
f(a) # f(z). 

It remains to show that f~! is continuous. Suppose that « € X and that 
N € N,. There exists V € B such that x € V C N, and, since (X,7T) is 
regular, there exists U € B such that e € U CU CV. Then (U,V) € S; let 
(U,V) = s;. Then f;(z) = 0 and f(z) =1lifz¢N. Let M={yEeH: yj < 
1}. Then M is an open neighbourhood of f(x), and if y€ Mn f(X) then 
f-l(y) € N. Thus f-!: f(X) > X is continuous. 


13.6 *Examples and counterexamples* 


(This section can be omitted on a first reading.) 


We now describe a collection of examples of topological spaces which 
illustrate the connections between the various ideas that we have intro- 
duced. The descriptions frequently include statements that need checking: 
the reader should do so. First, quotients can behave badly. 


Example 13.6.1 An equivalence relation ~ on R for which the quotient 
space is uncountable, and for which the quotient topology is the indiscrete 
topology. 


Define a relation ~ on R by setting x ~ y if x—y € Q. This is clearly an 
equivalence relation; let gq : R — R/ ~ be the quotient mapping. Each 
equivalence class is countable, and so there must be uncountably many 
equivalence classes. Suppose that U is a non-empty open set in R/ ~, so that 
q ‘(U) is a non-empty open subset of R, and so contains an open interval 
(a,b). If c € R there exists r € Q such that « —r € (a,b), and so q(x) € U. 
Thus U = q(R) = R/~. 

Next we consider the relations between the various separation properties. 


Example 13.6.2. A 7) space which is not Hausdorff. 


Let X be an infinite set, and let ty be the cofinite topology on X. The 
finite subsets are closed, so that (X,7/) is a T; space. If U = X \ F and 
V = X \G are non-empty open sets then UNV = X \ (F'UG) is non-empty. 
Thus (X,7;) is not Hausdorff. 


Example 13.6.3 A quotient of a closed interval which is a T, space but 
not Hausdorff. 
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° (1) 


q(0) q(1/2) = q (-1/2) "ach 


Figure 13.6a. A T\ space that is not Hausdorff. 


Define a partition of [—1,1] by taking {—1}, {0}, {1} and {-t,t}, for 
0 < t < 1, as the sets of the partition. (Fold the interval [—1,1] over, and 
stick corresponding points together, except for the points —1 and 1.) Let X 
be the corresponding quotient space and let q : [—1,1] + X be the quotient 
mapping. Give [—1, 1] its usual topology, and give X the quotient topology 
Tq. Since the equivalence classes are closed in [—1,1], X is a T, space. On 
the other hand, if U and V are open sets in X containing g(—1) and q(1) 
respectively, then there exists « > 0 such that 


iM) l=ex<xp<lPouny, 
and so X is not Hausdorff. 


Example 13.6.4 A separable first countable Hausdorff space which is not 
regular. 


Let X = R?, let L = {(z,y) € R? : x > 0,y = 0} and let P = (0,0). 
We define a topology 7’ on X by saying that U is open if U \{P} is open 
in R? in the usual topology and if P € U then there exists € > 0 such that 
N.(P)\L C U. The reader should verify that this is indeed a topology. (X, 7’) 
is separable, since the countable set {(r,s) € X : r,s € Q} is dense in X, 
and it is clearly first countable. Then r’ is a topology which is finer than the 
usual topology, and so (X,7’) is Hausdorff. Since P ¢ L, L is closed. Suppose 
that U and V are 7’-open subsets of X with P € U and L CV. Then there 
exists € > 0 such that N.(P) \ L C U. Then (€/2,0) € L C V, and so there 
exists 6 > 0 such that N5((€/2,0)) C V. Since (NV.(P) \ L) A N5((€/2,0)) is 
not empty, UNV is not empty. Thus (X,7’) is not regular. 


Example 13.6.5 <A quotient of the first countable space R which is 
separable and normal, but not first countable. 


We define an equivalence relation on R by setting « ~ y if « = y or if 
x and y are both integers; in other words, we identify all the integers. We 
consider the quotient space R/ ~, with the quotient topology. Let q: R > 
R/ ~ be the quotient mapping. Then R/ ~ is separable. It is also normal. 
Suppose that A and B are disjoint closed subsets of R/ ~. Suppose first that 
q(0) ¢ AUB. Then q7!(A) and q7!(B) are disjoint closed sets in R.\ Z, and 
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so there exist disjoint open subsets U and V in R.\ Z, such that g~!(A) C U 
and q~!(B) C V. Then q(U) and q(V) are open and disjoint in R/ ~, 
and A C q(U) and B C q(V). Suppose secondly that qg(0) € AU B, and 
suppose, without loss of generality, that g(0) € A. Then g~!(A) and q7!(B) 
are disjoint closed sets in R, and Z C q~!(A). Then there exist disjoint open 
subsets U and V in R, such that q~!(A) C U and q-!(B) C V. Then q(U) 
and q(V) are open and disjoint in R/ ~, and A C q(U) and B C q(V). Thus 
R/ ~ is normal. 

Now suppose that (N5)§o4 is a sequence of neighbourhoods of q(0) in 
R/ ~. Then for each j € Z there exists 0 < €; < 1 such that (j-—€;,j+€;) C 
q 1(N;). Let M = (—o00,1) U ( $219 — €;/2,9 + €;/2)). Then g(M) is a 
neighbourhood of (0) in R/ ~, and N; Z q(M) for j € Z. Thus (Nj)7, is 
not a base of neighbourhoods of q(0), and so R/ ~ is not first countable. 

There exist topological spaces which are regular, but not completely 
regular, but these are too complicated to describe here.! 

Before describing the next few examples, we need to prove an easy but 
important result about the usual topology on R. This is a special case of 
Baire’s category theorem, which is proved in Section 14.7. 


Theorem 13.6.6 (Osgood’s theorem) Suppose that (Un)°2, is a sequence 
of dense open subsets of R. Then N° ,Un is dense in R. 


Proof Suppose that (ao, bo) is an open interval in R. We must show that 
(ao, bo) (NPL, Un) is not empty. Since Uj is dense in R, the set (ao, bo) NU} is 
not empty. Since (ag, bo) NU is open, there exists a non-empty open interval 
(a1, 61) such that 


(a1, b1) C [a1, b1] C (a0, 00) NU. 


We now iterate the argument. Suppose we have defined non-empty open 
intervals (a;,b;) such that 


(45,05) © [aj, bj] S (aj—1, 0j-1) NU, 
for 1 <j <n. Then (an_1, bp-1) MU; is a non-empty open set, and so there 
exists a non-empty open interval (a,,,b,,) such that 


\ Ciracy Dias) Cc [Gnas Ba] Cc (Gi, Dg) NU pn. 


The sequence (a,,)°2.9 is increasing, and is bounded above by b,,, for each 
m € N, and so converges to a limit a. If n € N then ay < dni < a < 
bn41 < bn, so that a € (an, bp) C Un. Thus (ag, bo) A (NSU) F O. 


1 See Example 90 in Lynn Arthur Steen and J. Arthur Seebach, Jr., Countereramples in Topology, 
Dover, 1995. This is a wonderful comprehensive collection of counterexamples. 
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Corollary 13.6.7 Suppose that (C,)°72, is a sequence of subsets of R 
whose union is R. Then there exist n € N and a non-empty interval (c, d) 
such that (c,d) C Cn. 


Proof Tf not, each of the open sets U, = R \ Cy is dense in R, and 
ake eee 


In other words, C,, has a non-empty interior. 


Example 13.6.8 (The Niemytzki space) A separable first countable 
completely regular space (H,r’) which is not normal, and which has a 
non-separable subspace. 

Let H be the closed upper half-space H = {(x,y) € R?: y > 0}, let L be 
the real axis L = {(x,y) € R? : y = 0} and let U = H\ L be the open upper 
half-space. Let 7 be the usual topology on H. If (2,0) € LZ and € > 0 let 


Dit) ={(eere Ot uae toe ee}. 


D-,(«) is the open disc with centre (x, €) and radius ¢, and L is the tangent to 
D,(x) at (x,0). Let M.(x) = {(x,0)}UD,(a), and let T(x) be the boundary 
of D.(x) in U: 


Tin=4.0) 60 swe" 4 wee" Se I. 


If (z,0) € L let Mz = {M.(x) : € > O}, and let U = Ug oyerMz. Let o be 
the collection of all unions of sets in U/ and let 


P={VUS:Ver7,S Ec}. 


The reader should verify that 7’ is a topology; it is a topology on H finer 
than the usual topology. The two subspace topologies on U are the same, but 
if (z,0) € D then the sets in M, form a base of r/-neighbourhoods of (, 0). 


Tx) 


x L 


Figure 13.6b. The Niemytzki space. 
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(H,7') is separable, since the countable set {(r,s) € U : r,s € Q} is 
dense in H. It is first countable: for example, if (7,0) € L, the sequence 
(M1 n(x))p&1 is a base of neighbourhoods of (x, 0). 

(H,7’) is completely regular. Suppose that C is closed in H and that 
z € C’. We consider two cases. First, suppose that z © U. Then there exists 
N.(z) CU\C. Let f(y) = |ly — 2|| /e for y € N-(z), and f(y) = 1 otherwise. 
Then f is continuous, f(z) = 0 and f(c) = 1 for c € C. Secondly, suppose 
that z = (x,0) € L. Then there exists « > 0 such that M.(2) NC = 90. 
If w € D,(x), there exists a unique 0 < t < 1 such that w € T;.(x). Set 
f(w) = t, set f(z) = 0, and set f(w) = 1 if w ¢ M(x). Let us show that 
f is a continuous function on H. It is clearly continuous at every point of 
H other than z; since {w : f(w) < t} = M(x) for 0 < t < 1, f is also 
continuous at z. Further, f(z) = 0 and f(c) = 1 for cE C. 

The subspace L has the discrete topology, since M.(2) NL = {(x,0)}, for 
€ > 0. Since Z is uncountable, it is not separable. Further, it is closed in 
(H,7’). Thus any subset of L is closed in H. In particular, the sets 


A = {(z,0) : x irrational} and B = {(q,0) : q rational} 


are disjoint closed subsets of H. We shall show that if V and W are open 
sets in H such that A C V and BC W then VOW £ J, so that (H,7’) is 
not normal. 

For n € N, let A, = {x € R: (z,0) € Aand M)/,(x) C V}. Then 
Uc, An = {x € R: z irrational}. Hence 


R= (U%An)U | LJ {at 


qEQ 


This is a countable union of sets, and so by Corollary 13.6.7, the closure 
(in R, with its usual topology) of one of the sets contains an open interval 
(c,d). This clearly cannot be one of the singleton sets {q}, and so there exists 
n €N such that A, D (c,d). Suppose now that g € QN (c,d). Then there 
exists € > 0 such that M.(q) C W; we can suppose that « < 1/n. There 
exists x € A, such that |x — q| < «. Then (z,€) € M.(q)N Mj/p(x). Since 
M.(q) € W and Myjp(x) CV, VOW £9. 


Example 13.6.9 A normal topological space with a subspace which is not 
normal. 


Add an extra point P to the Niemytzki space (H,r’) to obtain a larger set 
H*, and define a topology t* in H* by taking as open sets those subsets 
V of H* for which 
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eVNH €7’, and 
e if PE V then L \ V is finite. 


Then (H,7’) is a topological subspace of (H*,7*) which is not normal. The 
7*-closed neighbourhoods of P form a base of neighbourhoods of P (why?), 
and so (H*,77) is regular. 

Suppose that A and B are disjoint closed subsets of (Ht,7*). If C is a 
closed subset of H* for which C/N L is infinite, then P € C. Thus either 
AN L is finite, or BNL is finite, or both; without loss of generality, suppose 
that AN L = F is finite. Since (H*,77*) is regular, there exist disjoint open 
subsets V; and W, such that F C Vj and B C W,. Now (BU L) \ {P} 
and A \ V; are disjoint, and are closed in H in the usual topology, and so 
there exist subsets V2 and W2 which are open in H in the usual topology 
such that A\ V, C V2 and (BUL)\ {P} C Wo. Then V = Vi U Vg and 
W = W,1 (We U {P}) are disjoint open subsets of (H*,7T), and A C V, 
B CW. Thus (Ht,77) is normal. 

In fact, every completely regular space is homeomorphic to a subspace of 
a normal space; this is too difficult to prove here. 


Example 13.6.10 A first countable separable normal topological space 
which is not second countable. 


Let B be the collection of half-open half-closed intervals [a,b) in R. This 
clearly satisfies the conditions for it to be the base for a topology r’ on R. 
(R,7’) is separable, since the rationals are dense; it is first countable, since 
the sets {[z,z+1/n) :n € N} form a base of neighbourhoods of x. Suppose 
that A and B are disjoint closed subsets of (R,7’). If a € A there exists a 
largest I, in (a,a +1] such that [a,l,)N B= 90; let U = Ugeala, la). Then U 
is an open set containing A and disjoint from B. Similarly, if b € B there 
exists a largest mp in (b, b+1] such that [b,m,) NA = Q; let V = Uses|d, ms). 
Then V is an open set containing B and disjoint from A. But if a € A and 
b € B then [a,l,) [b, mp) = 9, and soU NV = 9; thus (R,7’) is normal. 

Suppose that B’ is a base for the topology. For each x € R there exists 
B, € 8’ such that x € Bz C |z,2+1). But ifa Ay then B, # B,, so that 
Bb’ cannot be countable: (X,7’) is not second countable. 


Example 13.6.11 The product of two first countable separable normal 
topological spaces which is not normal, and which has a non-separable 
subspace. 


Consider (X,7) = (R,7’) x (R,7’), where (R,7’) is the space of the 
previous example. Then the line L = {(x,—ax) : x € R} is closed, and its 
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subspace topology is the discrete topology, so that L is not separable. All 
of the subsets of LZ are closed in X; an argument just like the one for the 


Niemytzki space shows that (X,7) is not normal. 


We 


can sum up some of our conclusions in a table. In the context of 


topological spaces, this shows that the choice of the word ‘normal’ is clearly 


not at all appropriate! 


subspace quotient countable uncountable 
product product 


T Yes No Yes Yes 
Hausdorff Yes No Yes Yes 
Regular Yes No Yes Yes 
Completely regular Yes No Yes Yes 
Normal No No No No 
First Countable Yes No Yes No 
Second Countable No No Yes No 
Separable No Yes Yes No 
Exercises 


13.6.1 


13.6.2 
13.6.3 
13.6.4 


13.6.5 


13.6.6 


Suppose that (z,)°2, is a convergent sequence in the space of 
Example 13.6.3. Show that it converges to one or two points. 

In Example 13.6.4, characterize the sequences which converge to P. 
In Example 13.6.5, characterize the sequences which converge to q(0). 
Show that an example similar to the Niemytzki space can be obtained 
by replacing the open sets D.(x) by triangular regions, of a fixed 
shape. Let 0 < a < 1. Let R(x) = {(u,v) EU: ju—2| < €,alu—2| < 
v < ae}, and replace the sets D,(x) by the sets R.(z). 

Suppose that f is a bounded real-valued function on [0,1] for which 
f(x) > 0 for all x € [0,1]. Use Osgood’s theorem to show that the 
upper integral fo f(x) dz is strictly positive. 

Show that the topology of Example 13.6.10 is finer than the usual 
topology on R. Characterize the convergent sequences. 


14 


Completeness 


14.1 Completeness 


The general principle of convergence played an essential role in the analysis 
on R; similar ideas are just as important in analysis on a metric space. As 
we shall see, these are not topological ideas. 

The definitions are straightforward. Suppose that (X,d) is a metric space. 
A sequence (x,,)°°., taking values in X is a Cauchy sequence if whenever 
€ > 0 there exists no such that d(am,2n) < € for m,n > no. 


Proposition 14.1.1 (i) A sequence (%p)°2, in a metric space which 
converges to a limit | is a Cauchy sequence. 

(ii) If a Cauchy sequence (a)°@, in a metric space has a subsequence 
(Ln, Je which converges tol, then tn + 1 as n — oo. 


Proof (i) Given € > 0 there exists ng such that d(rp,1) < €/2 for n > no. If 
m,n > no then d(am, tn) < d(@m,1)+d(l, rp) < €, by the triangle inequality. 

(ii) Given € > 0 there exists N such that d(am,2n) < €/2 for m,n > N, 
and there exists K with nx > N such that d(ap,,1) < €/2 for k > K. If 
n > nx then d(tn,l) < d(tn,%n,) + d(an,,1) < €, again by the triangle 
inequality, so that rz, > 1 as n— co. 


A metric space (X,d) is complete if every Cauchy sequence in X is con- 
vergent to a point in X. Thus the general principle of convergence says that 
R, with the usual metric, is complete. Let us give some examples. 


Theorem 14.1.2 A sequence («\™)°<, in R% or C4, with its usual met- 
ric, is a Cauchy sequence if and only if each of the coordinate sequences 
Cape for1<j<d, is a Cauchy sequence in R. 

Ré and C4, with their usual metrics, are complete. 
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Proof If (gr), is a Cauchy sequence in R@ (or C%) and 1 < j < d, 
then, since [eo - x\”)| < do(a™, a), the sequence (a) )°2, is a Cauchy 


sequence in R (or C). Conversely, if each of the sequences (x for 


1<j<d, is a Cauchy sequence, then there exists ng such that 


a 


[xo —s”| </d'/? for m,n > no and 1 <j <d. 


Thus do(a™, 2) < € for m,n > no, and (2°, is a Cauchy sequence. 

Thus if (gee, is a Cauchy sequence then each of the sequences 
Grey is a Cauchy sequence, and so converges to a limit x;, by the gen- 
eral principle of convergence. Hence x) + a = (%1,...2%q) as n > oo: thus 


R? and C%, with their usual metrics, are complete. 


Proposition 14.1.3 (i) A closed metric subspace A of a complete metric 
space (X,d) is complete. 
(ii) A complete metric subspace B of a metric space (X,d) is closed in X. 


oe) 


Proof (i) Suppose that (x,)°°., is a Cauchy sequence in A. Then (x,)°°, 
is a Cauchy sequence in X. Since (X,d) is complete, x, converges to an 
element / in_X as n > oo. But A is closed, and so! € A. Thus x, converges 
to an element of A, and A is complete. 

(ii) Let 6 be a closure point of B in X. Then there exists a sequence 
(b,)°2, in B which converges to b. Thus (b,,)°°, is a Cauchy sequence in B. 
But B is complete, and so 6, converges to a point / of B as n + oo. By the 
uniqueness of limits, 6 = /. Thus any closure point of B belongs to B, and 


so B is closed. 


Many of the metric spaces that we shall consider are spaces of func- 
tions. The next result, and its corollary, lie behind a great number of results 
concerning such spaces. 


Theorem 14.1.4 Jf S is a non-empty set and (Y,d) is a complete metric 
space then the space By(S') of bounded mappings of S into Y is complete 
under the uniform metric doo. 


Proof The proof follows a pattern common to many proofs of complete- 
ness. There are three steps. We start with a Cauchy sequence (f,,)°°,. First 
we identify what the limit f should be, secondly we verify that it is an 
element of By(S), and thirdly we prove that f, > f as n > oo. 

Suppose then that (f;,)°2, is a Cauchy sequence in (By(S),d..). Ifs € S 
then d(fm(s), fn(s)) < doo(fm; fn), and so (fn(s))°, is a Cauchy sequence 
in Y. Since (Y,d) is complete, there exists f(s) € Y such that f,(s) > f(s) 
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as n — oo. We claim that the function f : s > f(s) is in By(X), and that 
fn — f in the uniform metric. 

Take « = 1. There exists ng such that doo(fm, fn) < 1 for m,n > no, 
and so if s € S then d(fm(s), fn.(s)) < 1 for m > no. By Proposition 
13.3.2 (which we shall use repeatedly), d(fm(s), fno(S)) 4 d(f(s), fro (s)) as 
m — oo, and so d(f(s), fno(s)) < 1. Thus if s,t € S then 


U(f(s), F()) < AF (s); fro (8)) + Afro (8), fro (t)) + A fag (t), FO) 
< diam (fn. (S)) + 2, 


so that f € By(S). 

Finally we show that f, — f as n — oo. Suppose that € > 0. There 
exists n, such that doo(fm,fn) < €/2 for m,n > nj, and so ifs € S$ 
then d(fm(s), fn(s)) < €/2 for m,n > ny. Suppose that n > n1. Since 
d( f(s), fn(s)) > d( f(s), fn(s)) as m — oo, d(f(s), fn(s)) < €/2. Since this 
holds for all s € S, doo(f, fn) < €/2 < €. But this holds for all n > nj, and 
so fn > f asn—> oo. 


A Cauchy sequence in (By(S),do.) is called a uniform Cauchy sequence. 


Corollary 14.1.5 (The general principle of uniform convergence) If (X,7) 
is a topological space and (Y,p) is a complete metric space, then the space 
Cry(X,Y) of bounded continuous mappings of X into Y is complete under 
the uniform metric dx; a uniformly Cauchy sequence (fn)°@1 of bounded 
continuous functions converges uniformly to a bounded continuous function. 


Proof For Cy(X,Y) is closed in By(X), by Theorem 12.3.7. 


The first important theoretical point to make is that completeness is nota 
topological property. To see this, consider R and (—7/2, 7/2), each with the 
usual metric d. (R,d) is complete, by the general principle of convergence, 
but ((—7/2,7/2),d) is not, since it is not closed in R. Now consider the 
mapping j = tan~! from R onto (—7/2, 7/2). This is a homeomorphism; we 
use it to define a new metric p on R, setting p(x, y) = |j(x) — j(y)|. Then 7 
is an isometry of (R, p) onto (—7/2,7/2),d), and so (R, p) is not complete. 
But d and p are equivalent metrics, since 7 is a homeomorphism. Thus the 
two metrics d and p define the same topology on R: but (R,d) is complete, 
and (R, p) is not. 

We need a stronger equivalence to preserve completeness. A mapping f 
from a metric space (X,d) to a metric space (Y,p) is said to be uniformly 
continuous if for each « > 0 there exists 6 > 0 such that if 7,y € X and 
d(z,y) < 6 then p(f(x), f(y)) < «. The important feature of this definition 


14.1 Completeness 389 


is that while 6 may depend upon « (and usually does), it does not depend 
upon x or y. A Lipschitz mapping is uniformly continuous, and a uniformly 
continuous function is continuous. The real-valued function f(x) = 2? on R 
is an example of a continuous function which is not uniformly continuous. 


Proposition 14.1.6 Suppose that f is a uniformly continuous mapping 
from a metric space (X,d) into a metric space (Y, p). 

(i) If (an)P21 is a Cauchy sequence in X then (f(an))°24, ts a Cauchy 
sequence in Y. 

(ii) Suppose that (gn) is a sequence of functions from a set S to X which 
converges uniformly on S to a function g. Then f © gn converges uniformly 
on S to fog. 


Proof The proof of (i), which follows almost immediately from the defini- 
tions, is left as an exercise for the reader. The proof of (ii) is as easy. Given 
€ > 0 there exists 6 > 0 such that if d(x,x’) < 6 then p(f(z), f(2')) < «. 
There exists no € N such that ifn > ng then d(gn(s), g(s)) < 6 for alls € S. 
Thus p(f(gn(s)), f(g(s))) < € forn > no ands eS. 


A bijective mapping f from a metric space (X,d) onto a metric space 
(Y, p) is a uniform homeomorphism if f and f~! are both uniformly con- 
tinuous. Two metrics d and d’ on a set X are uniformly equivalent if the 
identity mapping 7: (X,d) > (X,d’) is a uniform homeomorphism. 


Corollary 14.1.7 If f is a uniform homeomorphism from a metric space 
(X,d) onto a metric space (Y,p) then (X,d) is complete if and only if (Y, p) 
is complete. If two metrics d and d' on a set X are uniformly equivalent 
then (X,d) is complete if and only if (X,d’) is complete. 


As an important example, let us consider a product X = []72,(Xi,di) of 
an infinite sequence of metric spaces. In Theorem 13.3.3 we constructed a 
metric p on X for which the p-metric topology is the product topology, and 
for which the cross-section mappings ky; are Lipschitz mappings. Inspec- 
tion of the construction shows that each of the coordinate mappings 7; is 
uniformly continuous. A metric d on X with all of these properties is called 
an uniform product metric. 


Corollary 14.1.8 Suppose that d is a uniform product metric on the prod- 
uct X = [],(Xi,d;) of non-empty metric spaces (X;,d;). Then (X,d) is 
complete if and only if each metric space (X;,d;) is complete. 


Proof This result holds for finite and infinite products. Suppose first 
that each metric space (X;,d;) is complete. Suppose that (ee is a 
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Cauchy sequence in (X, d). For each i, the sequence (7;(x‘)))°°, is a Cauchy 
sequence in (X;,d;), since the coordinate mapping 7; is uniformly continu- 
ous, and therefore converges to an element /; of X;. But this implies that 
a”) 5 | as n > oo, where I = (;). Thus (X,d) is complete. 

Conversely suppose that (X,d) is complete. Let (a())ox_, be a Cauchy 
sequence in (X;,d;). Choose y € Henig; X;. Since the cross-section 
mapping k,; is a Lipschitz mapping, it is uniformly continuous, and so 
(key a(a”)) 2° is a Cauchy sequence in (X,d). Since (X,d) is complete, 

(n) 
1 


key (aS) converges to an element | of X as n— oo, Then x; ’ > 7;(I) as 


n — oo, since 7; is continuous and a” = mi (Ky i(as”)). Thus (X;,d;) is 


complete. 


The next result provides a powerful test for completeness. 


Proposition 14.1.9 Suppose that X is a subset of a complete metric space 
(Y, p) and that d is a metric on X for which 

(i) the inclusion mapping j : (X,d) > (Y,p) is uniformly continuous, and 

(ii) for each x € X and each «€ > O the closed e-neighbourhood 
M.(a) = {a' € X : d(x’,x) < €} of x in X is p-closed in Y. 

Then (X,d) is complete. 


Proof Suppose that (2,)°°, is a Cauchy sequence in (X,d), and that 
€ > 0. There exists ng such that d(tm,%n) < €/2 for m,n > no. Since j 
is uniformly continuous, (x,,)?2, is a p-Cauchy sequence, and since (Y, p) is 
complete, there exists y € Y such that p(xp,y) +0 asn—-> oo. Ifm>n>n9 
then 2m € M./2(tn). Since M,/2(%n) is p-closed, it follows that y € M./2(xn). 
Thus y € X and d(ap,y) < €/2 < e. Since this holds for all n > no, tn > y 
as N — 00. 


We shall give applications of this in Proposition 14.2.3 and Corollary 
14.2.4. 

Next we prove a fundamental extension result. This depends in an 
essential way on the relation between uniform continuity and completeness. 


Theorem 14.1.10 Suppose that A is a dense subset of a metric space 
(X,d), and that f is a uniformly continuous mapping from A to a_complete 
metric space (Y,p). Then there is a unique continuous mapping f from X 
to Y which extends f: f(a) = f(a) fora € A. Further, f is uniformly 
continuous. If f is a Lipschitz mapping with constant K then so is f, and 


if f is an isometry then so is f. 
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Proof Suppose that « € X. There exists a sequence (a,)°72., in A such 
that a,x as n— oo. Then (a,,)°2, is a Cauchy sequence in A, and so 
(f(G@n))°2 is a Cauchy sequence in (Y,d). Since Y is complete f(a,,) con- 
verges to some element y of Y. We show that y does not depend on the choice 
of the approximating sequence (a,,)°°.,. Suppose that ai, > « as n > oo. 
Then, as before, there exists y’ € Y such that f(aj,) > y’ as n > ov. 
But then the sequence (a1, a‘, a2,a4,...) converges to x, and the sequence 


(f(a1), f(a), f(a2), f (a4), ...) converges in Y. The subsequences (f(an))° 1 
and (f (ai,))°, must therefore converge to the same limit; thus y = y! . We 


n=1 
set f(x) to be the common limit. We have thus defined the mapping f from 
X to Y; f clearly extends f (consider constant sequences). 

Next we show that fis uniformly continuous. Suppose that « > 0. There 
exists 6 > 0 such that if a,b € A and d(a,b) < 6 then p(f(a), f(b)) < «. 
Suppose that z,y € X and that d(z,y) < 6. Let 7 = 6 — d(x,y). There 
exist sequences (a,)°°, and (b,)°°, in A such that a, — x and bn > y as 
n — oo, and so there exists ng such that d(x,a,) < n/2 and d(y, bn) < n/2 
for n > ng. By the triangle inequality, 


(An; bn) < d(an, x) + d(x, y) + d(y, bn) < 6 


for n > ng, and so p(f (an), f(bn)) < € for n > no. Since f(an) > f(z) and 
f(bn) > f(y), it follows from Proposition 13.3.2 that pF (a ), fly) <e 
There is only one continuous extension. For if f and f are two con- 
tinuous extensions then {zéX:f(x)=f(x)} is a closed subset of X 
(Exercise 10.4.2) which contains A, and so contains A = X. 
If x,y € X there exist sequences (a,,)°2, and (b,)°2, in A such that 
Gn > x and b, > y as n — oo. Since f is continuous, 


f (an) = F(dn) + f(x) and f(bn) = fbn) + Fly) as n> 00. 


Thus o(f (x), f(y)) = limp +o p(f (an), f(bn)), by Proposition 13.3.2. Thus 
if f is a Lipschitz mapping with constant K, then 


p(f(«), f(y) < K Tim d(an,bn) = Kd(a,y), 


so that fi is also a Lipschitz mapping with constant A’. Similarly, if f is an 
isometry then olf (a ), f(y)) = = litt, $520, 0n) = dey), 80 That f is an 
isometry. 


We can characterize completeness in terms of sequences of sets. 
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Theorem 14.1.11 Suppose that (X,d) is a metric space. The following 
are equivalent. 

(i) (X,d) is complete. 

(it) If (An)°2, is a decreasing sequence of non-empty closed subsets of X 
for which diam (A,,) + 0 as n + 00 then NPL, An is non-empty. 

If so, then N°°, An is a singleton set {a} and if an € An for each n then 
Qn >a asn—-> oo. 


Proof Suppose first that (X, d) is complete, and that (A,,)°°, is a decreas- 
ing sequence of non-empty subsets of X for which diam A, > 0 as n > oo. 
Pick a, € Ap for each n € N. We shall show that (a,)°°, is a Cauchy 
sequence. Suppose that « > 0. There exists ng such that diam (A,,) < € for 
n>no. Ifm>n> no then am € An, so that d(am, an) < €. Thus (a,)°, 
is a Cauchy sequence; since (X,d) is complete, it converges to an element 
a of X. Suppose that n € N. Since a, € A, for m > n, and since A, is 
closed, a € Ap. Thus a € MP2, An. Further, diam (N°°,An) < diam Am, for 
each m € N, so that diam (N°2,A,) = 0. Thus A = {a}. 

Conversely, suppose that (ii) holds and that (x,) is a Cauchy sequence 
im (X,d). Let Ty, = (2, %aai,>..}: Wa )ey 16 the teil sequence. Since (z,,) 
is a Cauchy sequence, diam (T,) > 0 as n > oo. Let Fy = Tn. (Fn)&, is 
a decreasing sequence of non-empty closed sets, and diam (F;,) = diam (T,,) 
(Proposition 12.3.3), so that diam (F;,) — 0 as n > oo. Thus °F, is 
non-empty. Suppose that x € °°, Fy. Since x € F,, for each n, d(x, an) < 
diam F,,, and so d(z,z,) + 0 as n > oo. Thus z, > x as n > ov, and 
(X,d) is complete. 


Corollary 14.1.12 Suppose that (Ac)o<ce<e, is a family of non-empty 
closed sets of a complete metric space (X,d), with Ae C Ae for0<e< 
e <9. If diam A, > 0 as € + 0 then Noce<e,Ae is non-empty, and is a 
singleton set. 


Proof  No<e<epAe = N21 Bn, where By = Ag, /n- 


Although completeness is not a topological property, it has topological 
consequences. It is therefore of interest to know when there is an equivalent 
complete metric on a metric space. A metric space (X,d) is said to be 
topologically complete if there is an equivalent metric p on X for which 
(X,p) is complete. We use uniform product metrics to provide information 
about this. 

A subset of a topological space (X,T) is a Gs set if it is the intersection 
of a sequence of open sets (and is an F,, set if it is the union of a sequence 
of closed sets). If (X,7) is T, space, then the complement of a countable set 
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is a Gs set. Thus the set of irrational numbers in R is a Gs subset of R and 
the set of transcendental numbers in C is a Gs subset of C. The following 
theorem is therefore quite surprising. 


Theorem 14.1.13 Suppose that (X,d) is a complete metric space, and 
that A is a Gs subset of X. Then the metric subspace A is topologically 
complete. 


Proof First suppose that A is open. If A = X, we can take p = d. Oth- 
erwise, let X x R have a uniform product metric 0. Then (X x R,o) is 
complete. Consider the injective mapping f from A to X x R defined by 
f(a) = (a,1/d(a, C(A)). Since d(a, C(A)) > 0 for a € A, this is well-defined. 
Since the mapping a — 1/d(a,C(A)) is continuous on A, f is continuous. 
We show that f(A) is closed in X x R. Suppose that 


f(a@n) > y = (2,A) € X x Ras n- ov, 
so that a, > x and 1/d(a,,C(A)) > as n > on. Since 
|d(a@m,C(A)) — d(an,C(A))| < d(am, an), 


the sequence (d(an,C(A)))°2, is bounded, so that 1/d(a,,C(A)) does 
not tend to 0 as n->oo. Thus A¥40, and d(a,,C(A))-1/A. Since 
d(an,C(A)) > d(x, C(A)), it follows that d(a,C(A)) = 1/A, and so z € A. 
Consequently f(a,) > f(x) as n > oo, so that f(A) is closed in X x R, 
and (f(A),a) is complete. The mapping f : (A,d) > (X x R,c) is continu- 
ous, and the inverse mapping f~!: f(A) > A is a Lipschitz mapping with 
constant 1, since d(a,a’) < o(f(a), f(a’)), so that f is a homeomorphism of 
(A, d) onto (f(A),o). Thus if we define p(a,a’) = o( f(a), f(a’)) then p isa 
complete metric on A equivalent to d. 

Next, suppose that A = N72,U; is a G5 set. For each j there is a metric 0; 
on U;, equivalent to the restriction of d to U;, under which U; is complete. 
Let U = WG, oj), and let o be a uniform product metric on U. Then 
(U,c) is complete. If a € A, let i; : A + U; be the inclusion map and let 
i: A— U be defined as i(a) = (i;(@))72,. Then 7 is a continuous injective 
map of (A,d) into (U,a). We show that 7(A) is closed in U. Suppose that 
(an) —> u as n — oo Then, for each j € N, i;(an) 4 uj as n > oo. Since 
the metric a; on U; is equivalent to the metric d, d(an,u;) + 0 as n > oo. 
Since this is true for each j, there exists | in X such that u; = 1 for each 
j € N. Thus 1 € 992.,U; = A, and u = i(l). Thus i(A) is closed in (U,o), 
and so (i(A), a) is complete. If i(a,) > i(a) in (i(A), 0), then i1(a,) — 24(a) 
in (U,,0,) as n > oo. But d and oj are equivalent on Uj, and so a, > a in 
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(A,d). Thus i~! : (i(A), a) > (A, d) is continuous, and i is a homeomorphism 
of (A, d) onto (i(A),o). Thus if we define p(a,a’) = o(i(a), i(a’)) then pisa 


complete metric on A equivalent to d. 


14.1.1 


14.1.2 


14.1.3 


14.1.4 


14.1.5 


14.1.6 


14.1.7 


14.1.8 


14.1.9 


14.1.10 


Exercises 


Give an example of a bijection 7 from a metric space (X,d) onto a 
metric space (Y,p) which is uniformly continuous, while j~/ is not 
continuous. 

Give an example of a bijection j from a metric space (X,d) onto 
a metric space (Y,) which is uniformly continuous, while j~! is 
continuous, but not uniformly continuous. 

Suppose that f is a differentiable function on R. Show that f is 
uniformly continuous if the derivative f’ is a bounded function on 
R. Does the converse hold? 

Suppose that Y is a dense subset of a metric space (X,d) and that 
any Cauchy sequence in Y converges to an element of X. Show that 
(X,d) is complete. 

Define a metric p on N by setting p(m,n) = |1/m — 1/n|. Suppose 
that (ap,)°2, is a sequence in a metric space (X,d). Set f(n) = rp. 
Show that (x,)°°., is a Cauchy sequence if and only if f : (N,p) > 
(X,d) is uniformly continuous. 

Suppose that f is a continuous bijection from a complete metric 
space (X,d) onto a metric space (Y,p) and that f~! is uniformly 
continuous. Show that (Y,p) is complete. 

Ife e X =[]¥2,{0, 1};, let f(z) = 230%, z;/3). Show that f isa 
uniform homeomorphism of X onto Cantor’s ternary set. 

The set {0,1} becomes an abelian group when we define 0 + 0 = 
14+1=0,0+0=0+1=1. Use this to define an abelian group 
structure on X = eae uaa re Show that the mapping (2,y) > 
at+ty:X xX > X is jointly uniformly continuous. What about the 
mapping « > —2? 

Suppose that (X,d) is a metric space which is not complete. Show 
that there is an unbounded continuous real-valued function on X, 
and a bounded continuous real-valued function on X which is not 
uniformly continuous. 

Suppose that (x,,)°°., is a sequence in an ultrametric space (X, d) for 
which d(tp,%n+41) > 0 as n > oo. Show that (rp)°°, is a Cauchy 
sequence. 
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14.2 Banach spaces 


A normed space (£,]|].||) which is complete under the metric defined by 
the norm is called a Banach space, after the Polish mathematician, Stefan 
Banach. Similarly, an inner-product space which is complete under the met- 
ric defined by the inner-product norm is called a Hilbert space, after the 
German mathematician, David Hilbert. 


Proposition 14.2.1 Jf (E,||.||) is @ Banach space then the normed space 
(Bz(S),||-l|,) of bounded functions on S taking values in E is a Banach 
space. 


Proof This is an immediate corollary of Theorem 14.1.4. 


In particular, the spaces (Br(S), ||.||,,) and (Bc(S), ||.||,,) of bounded 
real-valued functions and bounded complex-valued functions on S are 
Banach spaces. 


Proposition 14.2.2 If (X,7) is a topological space and (E,||.||) ts a 
Banach space then (Cp(X, E), ||.||,,) 1s @ Banach space. 


Proof This is a special case of the general principle of uniform conver- 
gence. 


Proposition 14.2.3 Suppose that (F,||.||-) is a Banach space, and that E 
is a linear subspace of F equipped with a norm ||.||_ for which the inclusion 
mapping (E,||-\|~) > (F)|l-llp) ts continuous. If the closed unit ball By = 
{zt € E: |\a\|~ < 1} ts closed in (F;,||.||-) then (E£,||.||,2) 1s @ Banach space. 


Proof The inclusion mapping is uniformly continuous, and M,(x) = «+ 
eBr is closed in (F,_||.||-), since translation and multiplication by non-zero 
scalars are homeomorphisms. The result therefore follows from Proposition 
14.1.9. 


Corollary 14.2.4 The space (l1,||.||,) is a Banach space and the space 
(Io, ||.||,) is @ Hilbert space. 


Proof We give the proof for lg: the proof for J, is exactly similar. lg is a 
linear subspace of the Banach space (oo, ||.||,,), and the inclusion mapping 
(da, ||-llo) —* (loos |I-I],,) iS continuous, since ||z||,, < ||z||, for x € lo. It is 
therefore sufficient to show that B), is closed in (loo, ||.||,,). Suppose that 


(aes is a sequence in B),, which converges uniformly to x € I. Then 
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ai”) — ©; as n — oo, for each 7 € N, and so if k €¢ N then 


k k 

Ds (n) 2 
yo les? = lim Se" P sh 
j=l j=l 


Since this holds for all k ¢ N, )7°, |vj|? <1, so that x € By. 


As we shall see, there are many other applications of Proposition 14.2.3. 

We can consider infinite series in a normed space, and these can be used 
as a test for completeness. A series 0 a; in a normed space (£,||.||) is 
said to converge absolutely if LO ||a;|| converges. 


Proposition 14.2.5 Suppose that (aj) Foy is a sequence in a Banach space 
(E,||.||) for which Lo a; converges absolutely. Then Si a; converges, 


and | X32 43|| < Xo las 
Conversely, if every absolutely convergent series in a normed space 
(E, ||.||) converges then (E, ||.||) is complete. 


Proof Suppose that « > 0. There exists ng such that if n>m>np then 
j=m+1 llajl| <€. Let sn = )0%_, aj. By the triangle inequality, ifn > m > 


no then 
n 


\Sn — Sm] = [lama + +++ + nl] S » llajll <«, 
j=m+1 
so that (s,) is a Cauchy sequence in (£,||.||). Since (£,||.||) is complete, 
there exists s € & such that s, > s as n > co; that is, ei a; = s. Since 
the function x — ||z|| is continuous on E, 


[oe n (oe) 

| = < Jim, Dllajll =D lla. 
> 4j|] = lim. [Isl] < im, D7 llasll = D7 lal 
j=0 j=l o=F 


Conversely, suppose that every absolutely convergent series in (£, ||.||) 
converges. Let (%,,)°°, be a Cauchy sequence in (E,||.||). There exists a 
strictly increasing sequence (15 )F24 in N such that ifn > m > nj; then 
[te = Pig || 1/2?» Let a; =ae,,, and leb.ap = oy, — ea tory > 1. Then 
l|a; || < 1/291 for j > 1 and so ye j1 llaj|| < oo. Thus }75°, aj converges, to 
s say. But ae a; =Zn,, and 80 Ln, > 8 as N— oo. Thus 2, > 8 as nN > o0, 
by Proposition 14.1.1, and so (£,||.||) is complete. 


As special cases, we have the following corollaries. 
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Corollary 14.2.6 (Cauchy’s test for Banach spaces) The series 
je aj converges if limsup;_yoc Ila; [1/2 <1 and it does not converge if 


. 1/3 
lim supj_5o0 ||@;|| I> 1. 


Corollary 14.2.7 (D’Alembert’s ratio test for Banach spaces) Sup- 
pose that a; # 0 for all j. If limsup;_,., ||aj-+1| / ||aj|| <1 then D7) a; 
converges. If limsup;_,o ||aj-+11| / |aj|| > 1 then S052, aj does not converge. 


Corollary 14.2.8 (Weierstrass’ uniform M test) Suppose that (X,rT) is 
a topological space, that (E,||.||) 1s a Banach space and that (fj)72, is 
a sequence in (Co(X,F),||-\|.)) F Ifill, < Mj for each j € N, and 


24 M; < ov, then the series aren fj (x) converges uniformly to a bounded 
continuous function on X. 


When the conditions of this corollary are met, we say that the series 
Li fj (x) converges absolutely uniformly on X. 
The following special case is particularly useful. 


Corollary 14.2.9 Suppose that, for each j EN, Ges is a sequence 
in a Banach space (E,||.||) which converges to an element l, of E, that 


ie < M; for each n € N and each j € N, and that ya Mj < oo. 


Then the series ay he converges uniformly on N to a sequence (Sn)724 


in E, the series Sat lL; converges to an element 84. of E, and 8, — 8 as 
n> OO. 


Proof Let N = N U {oo} with the metric d(m,n)=|1/m—1/nl, 
d(n, +00) = 1/n. Let gj(n) = A for n € N, and let g;(co) = 1;. Then 
9; € O(N, £), and 9s lloo < Mj. Thus jet gj; converges uniformly to an 
element s of C,(N, £); this gives the result. (It is as easy to prove the result 
directly.) 


There is also a test for products. 


Corollary 14.2.10 (Weierstrass’ uniform M-test for products) Sup- 
pose that (X,T) ts a topological space and that (fj)?21 ts a sequence in 
(Co(X,R), |I-llo)- F ll fill. < My for each j € N, and SY°,M; < co, 
then the infinite product hare! + fj(x)) converges uniformly to a bounded 
continuous function on X. 
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Proof Since M; — 0 as j — oo there exists N € N such that M; < 1/2 
for 7 > N. We use the mean-value theorem. If |t| < 1/2, then 


0< © ogi +t) <2, so that |log(1 + fj(x))| < 2|fj(x)| < 2M; 
for 7 > N andz € X. Thus aN log(1+ f;(x)) converges absolutely and 
uniformly to a bounded continuous function g(x). The exponential func- 
tion exp has a bounded derivative on [— ||g||,, , ||g||,,], and so is uniformly 
continuous on [— ||g||,, 5 || ll o]- It therefore follows that []j-v4,(1 + fj(x)) 
converges uniformly to e!*). Finally, []j210 + fi(x)) converges uniformly 


to (Tjii(1 + fi(a)))e™. 


Exercises 


14.2.1 Suppose that (£, ||.||) is a real normed space. Show that the mapping 
(x,y) > «+y: Ex E > E is uniformly continuous, but that the 
mapping (A,2) > Av: R x EF > E is continuous, but not uniformly 
continuous. 

14.2.2 (Hardy’s test) (a) Suppose that (a;)72o is a null sequence of real or 

complex numbers for which } 05° 9 |aj—aj41| < 00 and that (b;)%2o is a 
sequence in a Banach space (F, ||.||) for which the sequence of partial 
sums () 7/9 b;)ro is bounded. Show that 577°.) a;b; converges. 
(b) Suppose that (a;)j2 is a null sequence of real or complex num- 
bers for which the sequence of partial sums 0 aj) 7-9 is bounded 
and that (bj)?2o is a sequence in a Banach space (£,||.||) for which 
> F=0 llbj — b;41]] < 00. Show that 51529 a;b; converges. 

14.2.3 (Hardy’s uniform test) Suppose that (aj) Zo is a sequence in the real 
or complex Banach space (B(S), ||.||,,) for which 


[oe) 
S- |laj — @441lloo < 00 
j=0 


and that (65) j20 is a null sequence in B(S') for which the sequence 
of partial sums (}7%_9b;)rio is bounded. Show that 05° 9 ajbj 
converges uniformly. 

14.2.4 (Dirichlet’s test) Suppose that (a;)72o is a decreasing null sequence 
of positive numbers and that (b;)72 is a sequence in a Banach 
space (E,||.||) for which the sequence of partial sums (oj=0 by) =H 
is bounded. Show that pa aj;b; converges, to s say. Show further 


14.2.5 


14.2.6 
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that if 
m 
Sm = S- a5); and M = sup Sob; 
j=0 ” |/7=0 


then ||s — 8 m|| < 2am4iM. 

(Abel’s test) Suppose that (a;)72o is a decreasing sequence of positive 
numbers and that (b;)72o is a sequence in a Banach space (F, ||.||) 
for which }77°9 6; converges. Show that )°7°.9 ajbj converges. 

This exercise and the next one give two classical applications of Weier- 
strass’ uniform M-test. Formulae such as these go back to Euler. Let 
P(0) =1 and 


Py (i) = (1- cri) (:- ca) _ (.- a) 


for 1 <j < k. Show that 


1 . i@ \ 2k+1 ‘ jg. \ 2k+1 
2% 2k+1 2k+1 


_ 3 ar Pj) 


Let X = {0} U {1/(2k +1): k EN} CR. Let 


f,(0) = <n, 


(27 + 1)! 
1 (—1)/ \ a2j-+1 
. = ee f <j<k 
=0 forj>k. 


(a) Show that f; is a continuous function on X. 
(b) Show that |) fj||,, = |0)2t"/(27 + DI. 
(c) Use Weierstrass’ uniform M-test to show that 


1 ; n 70 2k+1 _ es i0 2k+1 , og 
i +1 +1 . 


as k > oo. 
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14.2.7 Let 
aoa for y = 0, 
1+ cos 2jym 1+ cos 2jy7 : 
Loa. 9 |) =| |) ey 19, 
1 — cos 2jym 2sin* 7yT 
0 for 1/27 <y<1. 


(a) Show that f; € C[0, 1]. 

(b) Show that || fj||,, < 1/2772. (Use the inequality sin 0 > 20/7 for 
0<¢ < a2.) 

(c) Use Weierstrass’ uniform M-test to show that 


ss 9g (1+ cos 2jyn | 
Y \T—cos Qjyn * 2 Pu? 
j=l j=l 


asy \, 0. 
14.2.8 Give a direct proof of Corollary 14.2.9. 


14.3 Linear operators 


When we consider linear mappings between normed spaces, then continuity 
and uniform continuity are the same. Indeed, we can say more. 


Theorem 14.3.1 Suppose that (Fj, ||.||,) and (£2, ||.||,) are normed spaces 
and that T is a linear mapping from E, to E>. The following are equivalent: 

(i) K = sup{||T2ll9 : tlh, < 1} < co; 

(ii) there exists C € R. such that ||Tz||. < C'||z||,, for all x in Ey; 

(itt) T is Lipschitz; 

(iv) T is uniformly continuous on Ey; 

(v) T is continuous on FE}; 

(vi) T is continuous at 0. 


Proof (i) implies (ii): (ii) is trivially satisfied if ¢ = 0. Otherwise, let 
x1 = 2/ ||z||,. Then 


IT@)lo= IF elh elle = Mle Pls = lel IP@lle < * Mah - 


(ii) implies (iii): ||7\(v1) — T(x2)||2 = ||[T'(41 — #2)||2 < C lle — val, . 
Obviously (iii) implies (iv), (iv) implies (v) and (v) implies (vi). 
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(vi) implies (i): There exists 6 > 0 such that if ||a||, <6 then |/T(«)|], < 1. 
If ||z||,; <1 then |/d2]|, < 6, so that 


IT@)lly = | T(6x)||, = 3" Tx) Il < O*. 


We denote the set of continuous linear mappings from V; to V2 by 
L(V,, V2). We write L(V) for L(V,V). A continuous linear mapping from 
VY, to V2 is also called a bounded linear mapping, or a linear operator; a 
continuous linear mapping from V to itself is called an operator on V. 

Two norms ||.||, and ||.||, on a vector space FE are equivalent if the 
corresponding metrics are equivalent. 


Corollary 14.3.2 [f||.||, and ||.||, are equivalent norms on a vector space 
E then (E,||.||,) is @ Banach space if and only if (E,||.||,) is. 


Proof For they are uniformly equivalent, and so the result follows from 
Corollary 14.1.7. 


We have the following extension theorem. 


Theorem 14.3.3 Suppose that F is a dense linear subspace of a normed 
space (E,||.||,), and that T is a continuous linear mapping from F to a 
Banach space (G,||.||@). Then there is a unique continuous linear mapping 
T from E to G which extends T: T(y) = T(y) for y € F. If T is an isometry 
then so is T. 


Proof By Theorem 14.3.1, T is uniformly continuous, and so by Theorem 
14.1.10 there is a unique continuous extension T. which is an isometry if T 
is. We must show that T is linear. Suppose that x,y € & and that a, (6 are 
scalars. There exist sequences (%,)°°, and (yp)°2, in F such that rz, > x 
and yn > yas n— oo. Then ary + Byn > ax + By as n > co, and so 


T(axz + By) = lim T(atn + BYn) = lim T (arn + BYn) 
jim (aT (tn) + BT (Yn)) =o lim Tan) + 6 lim T(yn) 


=o lim T(tn) +8 Jim, T (Ym) = oT (x) + BT (y). 


Theorem 14.3.4 (i) L(Vi, V2) is a linear subspace of the vector space of 
all linear mappings from V, to V2. 
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(ti) If T € L(V, V2), set ||T|| = sup{||T()I[ : lla], < 1}. Then ||T'|| ts a 
norm on L(V,, V2), the operator norm. 
(iii) IFT € L(Vi,Va), and « € Vy then |IT(a)lly < IITII-llell- 


Proof. (i): We use condition (i) of Theorem 14.3.1. Suppose that 
S,T € L(V, V2) and that a is a scalar. Then 


sup{||(@7’)(x) || = [al], S 1} = la] supt{|T(w)Ilo = [lall, < Up, 
so that aT € L(Vj, V2) and 


sup{||(S + T)(«)llo = llall, < 1 
S sup{||S(@)llo : Hlall, < 1 + sup{|lT@)llo = lel < U, 


so that S+ T € L(V, V2). 
(ii): If ||T'|| = 0, then T(x) = 0 for x with ||z|| < 1, and so T(x) = 0 for all 
x: thus T = 0. ||aT|| = |a| ||Z'| and ||S + T|| < ||S|| + |||], by the equation 
and inequality that we have established to prove (i). 
(iii): This is true if 2 = 0. Otherwise, let y = 2/ ||x||,. Then ||y||, = 1, so 
that 
Talo = IT lely Mlle = lel IPMlle SIT Mell - 


Theorem 14.3.5 If (£4,||.||,) 1s a@ normed space and (E£o,||.\|,) is a 
Banach space then L(E\, F2) is a Banach space under the operator norm. 


Proof The proof is like the proof of Theorem 14.1.4. Let (Z;,) be a Cauchy 
sequence in L(E), E2). First we identify what the limit must be. Since, for 
each « € Fy, ||Tn(x) —Tm(z) lo < |[Zn — Imll llzllq, (n(z)) is a Cauchy 
sequence in £2, which converges, by the completeness of E2, to T(x), say. 
Secondly, we show that T is a linear mapping from FE, to E>. This follows, 
since 


T(ax + By) — aT (x) — BT(y) = lim (Tr(ax + By) — oTn(x) — BTn(y)) = 0, 


for all x,y € FE, and all scalars a, 8. Thirdly we show that T is continuous. 
There exists N such that ||T, — Tin|| <1, for m,n > N. Then 


\(F — Tv)(2)lly = lim, (Tn - Tw)(@)lly < lll, 
for each x € Fj, so that T — Ty € L(F4, E2). Since L(E;, E2) is a vector 


space, T = (IT — Tw) +Ty € L(F\, Ea). Finally we show that T, > T. 
Given € > 0 there exists M such that ||T, — Tn|| < €, for m,n > M. Then 
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ifm > M, anda ce Ey, 


I(T —Tm)(#)|ly = lion ||(Ta — Tm)(2YIl <€llall 


so that ||T — Tin|| < €. 


A linear functional on a vector space V is a linear mapping from V into 
the underlying scalar field. The vector space of continuous linear functionals 
on a normed space (F,||.||) is called the dual space E’; it is given the dual 
norm ||@||' = {sup |¢(zx)| : |x|] < 1}. This is simply the operator norm from 
(E, ||.||) into the scalars. 


Corollary 14.3.6 The dual space (E’,||.||') of a normed space (E,||.||) is 
a Banach space. 


Let us consider one important example. We need a definition. A mapping 
T from a complex vector space E into a complex vector space F' is conjugate 
linear if 


T(x+y)=T(x)+T(y) and T(axr) = aT (x) for z,ye E,a eC. 


Theorem 14.3.7 (The Fréchet—Riesz representation theorem) Suppose 
that H is a real or complex Hilbert space. If x,y € H, let ly(a) = (x,y). 
Then ly € H', and the mapping |: H — H' is an isometry of H onto H'. It 
is linear if H is real and is conjugate linear if H is complex. 


Proof | We consider the complex case: the real case is easier. The function 
ly is a linear mapping of H into C. Since 


Idy(2)| =| (@,9) |S lel -Ilyll, 


ly € H! and |\ly|’ < llyll. If y 4 0 then l,(y/ lyll) = llyll, so a yl’ = llyll- 
/ 

Thus ||2y||" = [yl]. Clearly ly, +y. = ly, + lyo, 80 that [lly. — ly |! = Ily1 — yall, 

for y1,y2 € H: lis an isometry of H into H’. Since 


ley (x) — (x, ay) = a (x,y) = al,y(x), 


1 is conjugate linear. 

The important part of the proof is the proof that | is surjective: every 
continuous linear functional ¢ on H is represented in terms of the inner 
product; there exists y € H such that ¢(x) = (x,y) for all x € H. 

If ¢ = 0, then ¢ = Ip. Otherwise, by scaling, we can suppose that ||@||’ = 1. 
First we show that there is a unique y in the closed unit ball B of H such that 
o(y) = 1. We use Theorem 14.1.11. Let A, = {2 € B: RG(x) > 1 -1/n}. 
Since ||¢||’ = 1 = sup,ep |¢(z)|, An is non-empty, and clearly (Ap)%, is 
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a decreasing sequence of closed sets. Suppose that 21,72 € Ap. By the 
parallelogram law, 


lex + all? + ler — wall? = 2(|leal|? + |laall’) < 4. 


Since Ro(x1 + x2) = RKG(a1) + KG(w2) > 201-1 /n), ||v1 + ze|| = 201—-1/n). 
It follows from this that 


\|z1 — all? < 4 —4(1 —1/n)? = 8/n—4/n? < 8/n. 


Thus diam (A,,) > 0 as n > ov, so that NneNAn is a singleton y. Then y is 
the unique element of B for which ¢(y) = 1. 

We now show that (x) = (a, y) for all x € H. Let w = x — (x,y) y. Then 
(w,y) = 0, so that w is orthogonal to y. Suppose that ¢(w) = re’’ # 0. If 
t > 0 then 


; 2 
1+ 2rt+r7e? = (o(y +e“ tw))? < ly + etl =1+?? |\w||”, 


so that 2r < t(||w||? — r?). Since this holds for all positive t, ||w|]? > r2. Set 
t =r/(|\w||? —r?): then 2r <r, giving a contradiction. Thus ¢(w) = 0: that 
is, 


(2) = bw) + (x,y) OY) = (#9) - 


This has the following consequence. 


Theorem 14.3.8 Suppose that H and K are Hilbert spaces and that T € 
L(H,K). Then there exists a unique S € L(K, H) such that 


(T(x), y) = (2, S(y)) for alla ec Hye K. 
Further, ||S|| = ||T'|. 
Proof Suppose that y € K. Then the mapping x > (T(), y) is a contin- 
uous linear functional, and so there exists a unique element, S(y) say, in H 


such that (T(x), y) = (x, S(y)).Then 


(x, S(y1 + y2)) = (T(x), y1 + y2) = (T(x), m1) + (T (2), y2) 
= (x, S(y1)) a (x, S(y2)) ) 
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so that S(yi + y2) = S(yi) + S(y2), and 
(x, S(ay)) = (T(z), ay) = a (T(z), y) = @ (2, S(y)) = (x, aS(y)) , 


so that S(ay) = aS(y); S is a linear mapping. 
If y € K then 


ISP = (Sy), SY) = (LS(Y),¥) < ITI -NS@)I- yl, 


so that ||S(y)|| < ||Z'|| . |ly||. Thus S is continuous, and ||S|| < ||T||. Similarly, 
if « € A then 


so that ||7'(x)|| < [S| |||], and ||T'l| < [|S]. Hence] S|] = 


Za)? = (La), T(@)) = (a, ST(e)) < ISI] -T(@)IL- lel 


ITI. 


S is the adjoint of T. If H and K are real, then S is denoted by 7’, and 
if H and K are complex, then S is denoted by 7™. 


14.3.1 


14.3.2 


14.3.3 


14.3.4 


Exercises 


Suppose that (F,||.||~), (F) ||-llp) and (G;,||.||~) are normed spaces 
and that B is a bilinear mapping from EF x F into G. Show that B is 
continuous if and only if there exists M > 0 such that ||B(a,y)\|o < 
M |lellg llylly for all (w,y) € Bx F. 

Suppose that (£,||.||_-), (F) ||.) and (G, ||.||~) are normed spaces. 
If T € L(E, L(F,G)), and x € E, y € F, let j(T)(x,y) = (T(z))(y). 
Show that 7(7’) is a continuous bilinear mapping from E x F into G. 
Show that j is a bijective linear mapping of L(E£, L(F,G)) onto the 
vector space B(E, F';G) of continuous bilinear mappings from E x F 
into G. If b € B(E, F;G), let 


[|| = sup tll, Wile: Iltlle < Lllylle < U- 


Show that this is a norm on B(E, F;G), and that with this norm the 
mapping 7 is an isometry. Deduce that if G is a Banach space, then 
so is B(E, F;G). 

Suppose that E and F' are Euclidean spaces, with orthonormal bases 
(€1,.--,;€m) and (fi,..., fn) and that T € L(E, F’) is represented by 
a matrix (t;;) with respect to these bases. What matrix represents 
the adjoint T’? 

If x € lg, let R(x); = 0 and (R(x))n = Zn_1 for n > 1: R is the right 
shift on lg. What is R’? Show that R’R is the identity on lz. What is 
RR’? 
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14.3.5 If H is a Hilbert space and T € L(H), then T is unitary (in the 
complex case) or orthogonal (in the real case), if it is an isometry 
of H onto itself. Show that T is unitary (orthogonal) if and only if 
PPsSPT=lit? =7Ts), 

14.3.6 Suppose that K is a closed linear subspace of a Hilbert space H. 

(i) Show that if « € H then there is a unique point P(x) in K such 

that ||2 — P(x)|| = inf{|ly — 2]: y € kK}. 

(ii) Show that P(a) is the unique point in K with x — P(x) € K+. 

(iii) Show that the mapping z — P(x) is linear and that P is 
continuous, with ||P|| < 1. When is ||P|| less than 1? 
P is the orthogonal projection of H onto K. 

(iv) Show that P = P* (P = P’) and that H= Ko K-. 
[Compare this with the construction in Proposition 11.4.3, when 
H is finite-dimensional.] 

14.3.7 Suppose that ¢ is a non-zero continuous linear functional on a Hilbert 
space H, and that ¢(a9) = 1. Let N be the null-space of ¢, and let 
P be the orthogonal projection of H onto N. Let zo = xo — P(20). 
Show that $(zo) = 1 and that ¢(x) = (x, z) / ||zoll”, for x € H. 

(This gives another proof of the Fréchet-Riesz representation 
theorem.) 


14.4 *Tietze’s extension theorem* 


(This section can be omitted on a first reading.) 


As an application of the results of the two previous sections, we prove 
Tietze’s extension theorem. We need a preliminary result, of interest in its 
own right. 


Theorem 14.4.1 Suppose that (F,||.||,) is a Banach space and that 
(E2,||-||2) 1s a normed space; let their closed unit balls be By and Bo, respec- 
tively. Suppose that T € L(F\, E2) and that there exist0<<t <1 ande>0 
such that «Bg C (1 —t)T(Bi) + teBe - that is, if y € eB there exist x € By 
and z € €By such that y = (1 —t)T (x) + tz. Then the following hold: 

(i) €Bo C T( By): 

(ii) T is surjective; 

(itt) If U is open in Ey then T(U) is open in Eo; 

(iv) (E2,||.||) 1s a Banach space. 


Proof (i) Suppose that zo € «Bg. Then there exist x; € By and 2 € «Bo 
such that zo = (1—t)T(x1)+tz1; iterating this, there exist sequences (x7,)?°_, 
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in By and (2,)P2, in €Bo such that z, = (1 —t)T (@n41) + t2n4i, forn EN. 
Thus 


o=-97e) +0 -)ren 4i Tee ePa 
= (1 _ tT (ey + t%g +--+ Unit) + rT ite 


Since 
[oe [o-e) (oe) 
Vie ellas Oo lle S10 =9, 
n=1 n=1 n=1 


yi, t’- lan converges absolutely to an element x of £), and 
|x|] < 1/(1 — #). Since t’+4z,41 > 0 as n > ov, it follows that z = 
(1 —t)T(x) = T((1 — t)z). Since ||(1 — t)a|| = (1 — £) |x|] < 1, 29 € T(Bi). 
(ii) T(F1) = iM Wiel Tore = Leer (By) ») Upc neBg = Ep. 
(iii) Suppose that y = T(x) € T(U). There exists 6 > 0 such that M5(x) = 
x +0B, C U. We show that y + «Bz C T(U), so that T(U) is open. If 
z=yt+w€ey+edBzy then there exists v € 6B, such that w = T(v). Thus 


2=7T (a) + To) =Teto) eT@+68)) CTU). 


Hence y + 66By C T(U). 

(iv) In order to show that (£2, ||.||,) is a Banach space, we use Propo- 
sition 14.2.5. By homogeneity, if y © E> there exists « € EF, with 
Z|, < |lyllo /e for which T(a) = y. Suppose that (yp)?2) is a sequence 
in Ey with 7°, |\ynllzp < co. For each n € N there exists x, € EF, with 
lla < [walle fe such that T(e,) = yq. Thus 5-7, ||tall, < co. Since 
(E;, |].||,) is a Banach space, }°°°., 2, converges in Ej, to s, say. Since T 
is continuous, 77°, Yn = 072 T (en) = T(s). Thus (£9, ||, ||) is a Banach 
space, by Proposition 14.2.5. 


Corollary 14.4.2 Suppose that (F,||.||,) is @ Banach space, that 
(£4, ||.||,) is a normed space and that T € L(E\, E2). Suppose that there 
exists 7 > 0 such that T(By) D> nBo. If U is open in Ey then T(U) is open 
im Eo. 


Proof Since T(B,) Cc T(Bi) + (7/2) Bo, it follows that nBz C T(B,) + 
(7/2) Bo. Set € = n/2 and t = 1/2; then eB, C (1 —t)T(B,) + teBg, and the 
result follows from the theorem. 


Theorem 14.4.3 (Tietze’s extension theorem) Suppose that f is a 
bounded continuous real-valued function on a closed subset A of a normal 
topological space (X,7). Let 


M =sup{f(a):a€¢ A}, m=inf{f(a): ae A}. 
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Then there exists a continuous real-valued function g on X such that g(a) = 
f(a) forae A, andm< g(a) <M forze xX. 


Proof ‘The result is obviously true if f is constant. Otherwise, by consid- 
ering f — (M + m)/2, we can suppose that m = —M, and by considering 
f/\|f\|, we can suppose that M = 1 and m = —1. Let R be the restriction 
mapping from C)(X) — Cy(A); R(f) = fia. We show that R satisfies the 
conditions of Theorem 14.4.1, with t = 2/3 and « = 1. Let 


B={a€éA: f(a) > 1/3} and C={ae€ A: f(a) < -1/3}. 


Then B and C are disjoint closed subsets of X, and so by Urysohn’s lemma 
there exists g € Cy(X) with ||g||,, < 1/3 such that g(b) = 1/3 for b € B and 
g(c) = —1/3 for ce C. Then 


0< f(x) -—g(x) < 2/3, forxe B, 
0> f(x) — g(x) > -2/3, forx EC, 
If(x) — g(a)| < |f(@)| + |9@)| < 2/3 for re A\ (BUC). 

Thus ||f — R(g)|loo < 2/3. Let us set p = 3g and q = (3/2)(f — R(g)). 


Then |[p||,, < 1, flail, < 1 and f = (1/3)R(p) + (2/3)q. Consequently, if 
f € C,(A) there exists g € C)(X) with R(g) = f and |lg||,, = lf ll, 


We can drop the requirement that f is bounded. 


Corollary 14.4.4 (i) Ifm < f(x) < M there exists a continuous function 
g on X such that R(g) = f and such thatm < g(x) < M forre X. 

(ii) If F is a continuous function on A then there exists a continuous 
function G on X such that G(a) = F(a) foraeé A. 


Proof (i) Again, we can suppose that M = —m = 1. There exists h € 
C,(X) such that R(h) = f and |[Al|,, < 1. Let D= {x € X: |h(x)| = 1}. 
Then D is a closed subset of X disjoint from A. By Urysohn’s Lemma, there 
exists k € C)(X) with 0 < k < 1 for which k(a) = 1 for a € A and k(d) =0 
for d€ D. Then g = h.k has the required properties. 

(ii) Let f = tan-! oF, Then f € C;(A) and —1/2 < f(a) < 1/2 fora€ A. 
By (i), there exists g € C)(X) with R(g) = f and —7/2 < g(x) < 1/2 for 
xe xX. Let G=tanog. 


14.5 The completion of metric and normed spaces 


Starting with the field Q of rational numbers, we constructed the field R of 
real numbers. This fills up the gaps in the rationals — any Cauchy sequence 
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converges — but does so in an efficient way, since any real number is the limit 
of a sequence of rational numbers. 

We can do the same for any metric space. A completion of a metric space 
(X,d) is a complete metric space (X, d), together with an isometric mapping 
j of X onto a dense subset j(X) of X. We have the following fundamental 
theorem. 


Theorem 14.5.1 Any metric space (X,d) has a completion. The com- 
pletion is essentially unique: if ((X,d),j) and ((X,d),j) are completions 
of (X,d) then there is a unique isometry k of (X,d) onto (X,d) such that 
i 
j =koj. 


Proof We give two proofs of the existence of a completion. The first is 
short, but quite artificial. We have shown in Example 11.5.12 that there is an 
isometry j of (X,d) into (B(X),||.||,,), and have shown that (B(X), ||.||,,) 
is a Banach space. We therefore take X to be the closure j(X) of j(X) 
in B(X), and take d to be the subspace metric. Then (X,d) is complete 
(Proposition 14.1.3) and j(X) is dense in (X,d). 


The second proof is longer but more natural, and is useful when we con- 


lo) 
n=1 


converge to a unique element of the completion, so that (a,)°°., determines 


sider normed spaces. If (a,)°2., is a Cauchy sequence in X then j(a,) must 
an element of the completion. In general, however, there are many Cauchy 
sequences which determine this element. We therefore define the elements 
of the completion of (X,d) to be equivalence classes of Cauchy sequences 
in X. 

Let Y be the set of all Cauchy sequences in (X,d). Suppose that a = 
(Gn )°2, and b = (b,)°2, are in Y. If € > 0 then there exists no € N such 
that d(@m,a@n) < €/2 and d(bm,bn) < €/2 for m,n > no. It follows from 
Proposition 13.3.2 that 


|\d(@m, bm) — (an, bp)| < d(am,@n) + d(bm, bn) < €, for m,n > no. 


Thus (d(an, bn))°21 is a Cauchy sequence of real numbers, which converges, 
by the general principle of convergence, to a limit p(a,b). Clearly p(a,b) = 
p(b, a), and 
poo) = lim O(Bing Gy) 
< lim d(an, On) + lim d(bn, Cr) = p(a, b) + p(b, c), 


so that p is a pseudometric on Y. We now apply Proposition 11.1.13: there 
exists an equivalence relation ~ on Y and a metric d on the quotient space 
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Y/ ~ (which we denote by X) such that the quotient mapping q : Y > X 
satisfies d(q(a), q(b)) = p(a,b), for a,b EY. 

Next we define the mapping j : X —> X.Ifre X, let x, = x for alln € N; 
the constant sequence c(z) = (#p)°°, is certainly a Cauchy sequence. We 
set j(x) = q(c(x)). If x, x’ € X then 


p(e(x),e(2!)) = lim d(an, yn) = d(x, 9), 

so that d(j(x), j(y)) = d(x,y), and so j is an isometry of (X,d) into (X,d). 

We now show that j(X) is dense in (X,d). Suppose that #¢X 
and that ¢=q(a) where a=(a,)°2,¢€Y. If NEN then p(a,c(ay)) = 
limn—+oo d(an,an) + 0 as N > oo. Thus d(#,j(ay)) 3 0 as N > «~, 
so that j(X) is dense in (X,d). 

The metric space (X,d), together with the isometry j, will be the 
completion of (X,d). 

We now come to the crux of the proof, and show that (X,d) is complete. 
Suppose that (#*))9°., is a Cauchy sequence in (X,d). Since j(X) is dense 
in (X,d), there exist a, € X with d(#™), j(a,)) <1/k, for k © N. Then 


di (we), j(@)) — d(@,@)| < 1/k + 1/L, 


so that (j(x,))%, is a Cauchy sequence in (X,d). Since j is an isom- 
etry, (vp )P2, is a Cauchy sequence in (X,d). Let @=q((rz)7,). Then 
d(j(1),@) = limg-so0 d(aj,2%) and so d(j(aj),@) 4 0 as 1 > oo, since 
(xp)p21 is a Cauchy sequence. Thus j(x;) + & as | — oo. Consequently 


d(@ ,#) < d(é, j(w1)) + dG (ai), #) < 1/1 + d(G(a), #) > 0 


as 1 oo. Thus %; > % as 1 > oo. 

Finally, we show that the completion is essentially unique. The mapping 
jog> ‘nu 
By Theorem 14.1.10, there is a unique continuous extension k : (X,d) > 


is an isometry from j(X) into (X,d), and so is uniformly continuous. 


(X,d), and k is an isometry. k(X) is therefore complete, and so is closed 
in X. But j’(X) C k(X), and j’(X) is dense in X, and so k(X) = X: k is 
surjective. 


Because of the essential uniqueness of completion, we usually talk about 
the completion of a metric space, and consider X as a dense subset of its 
completion (just as we consider the field Q of rational numbers as a subfield 
of the field R of real numbers). 

There is a corresponding result for normed spaces. 
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Theorem 14.5.2 Suppose that (E,|].||) 7s a normed space. There exists a 
Banach space (E,||.||°), and an isometric linear mapping j : E + E such 
that j(E) is a dense linear subspace of E. (E,|\.||°) is the completion of E. 
It is essentially unique: if ((E,||.||), 7’) is another completion then there is 
a unique linear isometry k of (E,||.||°) onto (E,||.||-) such that j! = koj. 


Proof | Consider the second construction of Theorem 14.5.1, using Cauchy 
sequences. The space Y of Cauchy sequences has a natural vector space 
structure: define 


(Gn)per + (bn) per = (an + bn)po,  A(Gn)Ry = (Aan) 1 


verifying that the sum and scalar product are in Y. The pseudometric p 
is given by the seminorm 7, where z(a) = p(a,0) = limp+.|lan||. The 
equivalence class q(0) to which 0 belongs is 


N = {(an)72]1 : Gn 7 0 as n > oo}, 


which is a linear subspace of Y. Further, g(a) = a+ N, so that E is the 
quotient vector space E/N, and j is a linear mapping of EF into E. If we 
set ||@||° = d(#,0), then |j.||* is a norm on E which defines d, and under 
which E is a Banach space. The facts that j is an isometry and that 4(E) is 
dense on E come from Theorem 14.5.1, as does the existence of an isometry 
k : E + E. It remains to show that k is linear. If ZY E E, there exist 
sequences (,p,)°°, and (y,)°@, in & such that j(t,) > £ and j(yn) > ¥ 
as n — oo. Then j(@p + yn) ~ £+ 9%, and so, using the continuity of 7, 7’ 
and k, 


oa lim (ian) + 7'(Yn)) a jim "(an + Yn)) 


= k( lim (j(n + Yn))) = WE + 9). 


Scalar multiplication is treated in a similar way. 


Again, we consider (£,||.||) as a dense linear subspace of its completion 


(E, [I°). 


Exercise 


14.5.1 Define the notion of a convergent sequence in Q and a Cauchy 
sequence in Q, using rational numbers, rather than real numbers. 
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Show how the second proof of Theorem 14.5.1 can be used to con- 
struct the completion Q of Q. Show that Q is an ordered field, and 
that every non-empty subset of Q which is bounded above has a least 
upper bound. 

[This is another way of constructing R from Q.] 


14.6 The contraction mapping theorem 


If f isa mapping of a set X into itself, then an element x of X is a fixed point 
of f if f(z) = x. As we shall see, fixed points frequently have interesting 
properties. 

A mapping f : (X,d) > (X,d) of a metric space into itself is a contraction 
mapping of (X,d) if there exists 0 < K < 1 such that d(f(x), f(y)) < 
Kd(a,y) for all x,y € X; that is, f is a Lipschitz mapping with constant 
strictly less than 1. The fact that the constant K is strictly less than 1 is of 


fundamental importance. 


Theorem 14.6.1 (The contraction mapping theorem) [If f is a contraction 
mapping of a non-empty complete metric space (X,d) then f has a unique 
fixed point Xoo. 


Proof Let K be the Lipschitz constant of f. Let 29 be any point of 
X. Define the sequence (x,,)?2.9 recursively by setting tp+1 = f(x,). Thus 
i, =f" (eq), and 


dis tec} < KGa aita) < K?d(an—2,£n—1) Sees K"d(xo, 21). 


We show that (a%,)?°_9 is a Cauchy sequence. Suppose that € > 0. There exists 
no€N such that A” <(1—K)e/(d(xo, 21) +1) for n>no. If n>m>nNno 
then 
Gl teins Te) < i irs ane D, ale Dy its Tg) aii bl Gi, Tn) 
< K™d(xo, 21) + K™*"d(ao, £1) + +--+ K"-'d(xo, 21) 
< K™d(xo,21)/(1- K) <e. 


Since (X,d) is complete, there exists 7. € X such that rn > oo as N—> OO. 
Since f is continuous, tn41 = f(@n) > f (@oo) as N > 00, and 80 Log = f (Loo): 
Loo is a fixed point of f. If y is any fixed point of f then d(y,r.) = 
a f(y), J Geo)) < Kdly; too); Wence dly, to.) = 0, and y = Gee} Bec is the 
unique fixed point of f. 
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Three points are worth making about this proof. First, we start with any 
point x9 of X, and obtain a sequence which converges to the unique fixed 
point 20; further, d(xo, 20) < d(xo,21)/(1 — K). Secondly, d(an41, 20) = 
dif (aia), f (sa) & Kaley, Ton), 80 that dltg, to5) < Kd eG, ton J; the con- 
vergence is exponentially fast. Thirdly, the condition that d(f(zx), f(y)) < 
d(x,y) for « # y is not sufficient for f to have a fixed point. The func- 
tion f(z) = «+e : [0,c0) > [0,00) does not have a fixed point, 
but satisfies the condition; if 0 < y < x < oo then, by the mean-value 
theorem, f(x) — f(y) = (1 — e-°)(a% — y) for some x < c < y, so that 
If(x) — Fy) < |z—- yl. 


Corollary 14.6.2 Suppose that g is a mapping from X to X which 
commutes with f: fog=gof. Then Xo is a fixed point of g. 


Proof f(9(%0)) = 9(f(oo)) = g(%oo); G(Too) is a fixed point of f, and so 
Gl Fee) = Xoo- 


We can strengthen the contraction mapping in the following way. 


Corollary 14.6.3 Suppose that h : (X,d) > (Xqa) is a mapping of a com- 
plete metric space into itself, and suppose that h* is a contraction mapping 
for somek EN. Then h has a unique fixed point. 


Proof Let f = h®. Then f has a unique fixed point 7. As foh=hof = 
f'*1, xoo is a fixed point of h. If y is a fixed point of h then f(y) = h®(y) = y, 
so that y = 209; Yoo is the unique fixed point of f. 


Suppose that (X,d) and (Y,p) are metric spaces and that f:X x Y ~ Y 
is continuous. Can we solve the equation y = f(x,y) for each x € X? In 
other words, is there a function ¢: X — Y such that $(x) = f(x, ¢(x)) for 
each « € X? If so, is it unique? Is it continuous? 

Our first application of the contraction mapping theorem gives sufficient 
conditions for these questions to have a positive answer. It can be thought 
of as a contraction mapping theorem with a continuous parameter. 


Theorem 14.6.4 (The Lipschitz implicit function theorem) Suppose 
that (X,d) is a metric space, that (Y,p) is a complete metric space 
and that f:X xY—-Y is continuous. If there exists 0< K <1 such that 
pP(f (x,y), f(x, y')) < Kply,y’) for alla € X and y,y' € Y then there exists 
a unique mapping ¢: X > Y such that (x) = f(x, ¢(x)) for each x € X. 
Further, @ is continuous. 


Proof The proof of existence and uniqueness follows easily from the con- 
traction mapping theorem. If « € X then the mapping f, : Y — Y defined 
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by fr(y) = f(x,y) is a contraction mapping, which has a unique fixed point 
(x). Then f(x, 6(x)) = fe(9(@)) = (2). 


It remains to show that @ is continuous. Suppose that « € X and that 
€ > 0. There exists 6 > 0 such that if d(x, z) < 6 then 


If d(x, z) < 6, then 


P(9(@), O(2)) S (G@), Fz, O(@)) + p(T (2, O(@)), F(z, (2) 
< (1— K)e+ Kp($(2), (2), 


so that p(¢(x), d(z)) <e. 


Our next application, which uses Corollary 14.6.3, gives a proof of 


the existence and uniqueness of solutions of certain ordinary differential 
equations. 


Theorem 14.6.5 Suppose that M > 0 and that L > 0. Suppose that H is 
a continuous real-valued function on the triangle 


T ={(2,y) € R?:0<2<8,ly| < Mo}, 


that |H(x,y)| <M and that |H(e,y) — H(x,y!)| < Ly —y!l, for (x,y) € 
T and (x,y’) € T. Then there exists a unique continuously differentiable 
function f on [0,b] such that f(0) =0, (x, f(x)) € T for x € [0,6] and 


af 


an (x) = H(a, f(x)) for all x € (0, d]. 


Proof If f is any solution, then 
If(x)| = |f(@) — F()| = |f A(t, f(t))dt| < | | (t, f(t))|dt < Ma, 
0 0 


for 0 < a < b, so that the graph of f is contained in T’. The second condition 
is a Lipschitz condition, which is needed to enable us to use the contraction 
mapping theorem. 

First let us observe that the fundamental theorem of calculus shows that 
solving this differential equation is equivalent to solving an integral equation. 
If f is a solution, then, as above, 


f(x) = f(x) — f(0) = [re dt = [ ne.seyae for all x € [0,0]. 
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Conversely, if f is a continuous ae ea oe i integral equation, 
then f(0) = 0 and the function J(f =f, )) dt is differentiable, 
with continuous derivative H(z, Me ~~ Thus oe cay = ee )) for x € 
[0, b]. 

Let X = {g € C[0,}] : |g(x)| < Ma for x € [0,b]}. X is a closed subset 
of the Banach space (C0, }], ||.||,,), and so is a complete metric space under 
the metric defined by the norm. We define a mapping J : X — C[0,6] by 
setting 


J(g)(z) = A H(t, g(t)) dt, for x € [0, b]. 


Then J(g) is a continuous function on [0,b] and 
ial < f° Mat = Me. 
0 
so that J(g) € X. We now show by induction that, for each n € ZT, 
Lx” 
|J"(g)(@) — J"(h)(@)| < ——|l9 — Allo» for g,h Ee X andO<ax<b. 
n! 
The result is certainly true for n = 0. Suppose that it is true for n. Then 
|r g(a) — IPT*(h) (a) | < | [AT(t, J"(9)(t)) — H(t, J" (h)(t))| dt 
< [rma — manele 


x prti 
< | —— |g — hla. t” at 


Lrtigntl 
Sag lls 
(n+ 1)! 
Thus 
a " Ee i 
7") — F™(AY loo S M19 = Pleo 


Now L"b"/n! — 0 as n — ov, and so there exists k € N such that 
Lkok /k! <1. Thus J* is a contraction mapping of X. We apply Corollary 
14.6.3. J has a unique fixed point f, and f is the unique solution of the 
integral equation. 


The next application of the contraction mapping theorem is an inverse 
function theorem. 
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Theorem 14.6.6 (The Lipschitz inverse function theorem) Suppose that 
U is an open subset of a Banach space (E,||.||) and that g: U > E is a 
Lipschitz mapping with constant K < 1. Let f(x) = x+9(2). If the closed 
neighbourhood M,(x) of x is contained in U then 


Ma-Kye(f(#)) © F(Me(a)) © Mareye( f(x). 


The mapping f is a homeomorphism of U onto f(U), f~! is a Lipschitz 
mapping with constant 1/(1— kK), and f(U) is an open subset of E. 


Proof Since x — y = (f(x) — fly)) — (g(x) — g(y)) 
lz — wll < F(z) — FIL + Ilg@) — o@)I S Ife) — Fy) + K lla — yl, 


so that || f(x) — f(y)|| > (1— K) ||z—y|. Thus f is one-one, and f~! isa 
Lipschitz mapping with constant 1/(1 — K). 

Suppose that x € U and that the closed neighbourhood M,(z) is contained 
in U. Then 


f(x) — FYI < Ile — yl] + [lg@) — gM) S A+ 4) |e — yl, 


so that f(M.()) C Ma+x)e(f(2)). 

Suppose that y € Mq_x)e(f(x)). Let h(z) = y— g(z), for z € M.(x). We 
shall show that h is a contraction mapping of M,(x). First, if z € M-(x) 
then 


[h(2) — ell = ly — 2 — 92) = lly — Fe) + 9(@) — a2) 
< |ly— fe) + llg(@) -— 9()|| < - K)e+ Ke =e, 
so that h(M-.(a)) C M.-(x). Secondly, 
|[h(z) — h(w)|| = Ilg(z) — g@)|| < K |]z— wll, 


so that Ah is a contraction mapping of M,(x). Since M,(zx) is closed, it is 
complete, and so h has a unique fixed point v. Then v = y — g(v), so that 
y = fv) € f(M-(z)). Thus Mqa_x)e(f(@)) © f(Me(x)). Consequently, f(U) 
is an open subset of E. 


We shall use this theorem later to prove a differentiable inverse function 
theorem (Theorem 17.4.1). 

We can apply this result to linear operators on (£,||.||). In this case, 
however, it is more natural to proceed directly. Suppose that $,T € L(E). 
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Then the composed map So T € L(£). Further, 


[So T]| = supt{||S(7(2))| = [lel] < 1 


[S|] sup{|/7(@) |] = [lal] < 1} = IS] (Z1. 


IA 


We set T° = I. Then ||7|| < ||T'||" for all n € Z*. We use this inequality 
to prove the following. 


Theorem 14.6.7 Suppose that (E,||.||) is a Banach space, that T € L(E) 
and that \|T* || <1 for some k € N. Then \>°°.,T" converges absolutely 
in L(E). If S = 57°) 7” then (I -T)S = SUI -T) =I, so that I-T is 
invertible, with inverse S. 


Proof Since ||T"*|| < ||T*||" for n € N, 77°, ||T"*|| < oo. Thus 


love) k-1 ow k-1 oc 
dirt = o> eee" < Oe eel fe 
n=0 j=0 n=0 j=0 n=0 


) <x 


and so }>°° ) 7” converges absolutely in A, to S, say. In particular, T” — 0 
as n — oo. Let Sp = 77-9 T). Then 


k-1 lee) 
(SII (s.r 
j=0 n=0 


T=2)5,=5,0=Pyslot i eye, 


But (I —T)S, ~ (I —T)S and S,(1 —T) > S(I —T) as n > ov, and so 
Cem oe tem eee 


Corollary 14.6.8 Jf ||T|| <I then I —T is invertible, and 
(2 -—7)4I] < 1/0 = IIT. 


Proof Wecan take k = 1. Then 


(oe) (oe) 1 
Iss SOIT" s So TI" = T=] 71 
n=0 n=0 


The series }>°° ) 7” is called the Neumann series. 
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Corollary 14.6.9 Let GL(E) be the set of invertible elements of L(E). 
Then GL(E) is an open subset of L(E), and the mapping S > S7! is a 
homeomorphism of GL(E) onto itself. 


The set GL(E) is a group under composition. It is called the general linear 
group. 


Proof Suppose that SEGL(E). Let a= S27". Suppose that 
\|U || <a/2. Then ||US™'|| < ||U||/a < §, so that [+ US™! is invertible 
and ||(I+US~!)"|| < 2. Then $+U = (I +US~1)S is invertible, with 
inverse S~!(1+US—')—!, so that GL(E) is an open subset of L(£). Further, 
|S +U)|| < |[s-4] .||@+Us-1)-}]] < 2/a. Now ($+ U)-1- $-1 = 
—($+U)—1US“!, so that 


(S$ + U)* — S| < 20] /o?, 


and (S+U)~! + $~! as U > 0. Thus the mapping S — $7! is continuous 
on GL(E). Since (S~1)~! = S, it follows that the mapping S > S~! isa 
homeomorphism of GL(£) onto itself. 


Corollary 14.6.10 Suppose that E and F are Euclidean spaces and that 
1<k<d=dimF. The set L,(E, F) of linear mappings in L(E, F) of rank 
greater than or equal to k is an open subset of L(E, F). In particular, the set 
La(E, F) of surjective mappings in L(E, F) is an open subset of L(E, F). 


Proof Suppose that T € L,(E,F). Let N be the null-space of T, let 
E, = N+, and let j : E, > E be the inclusion mapping. Let F, = T(E) = 
T(E) and let P: F > F; be the orthogonal projection of F onto F;. Then 
T, = PoT oj isa linear isomorphism of E; onto F;. Let 6 = 1/ |Z, *|]. If 
$1 € L(E\, F)) and ||S; — T;|| < 6 then ||T7' oS; — I|| < 1,so that T7'S € 
GL(E,). Hence Sj is a linear isomorphism of FE onto F). If S€ L(E, F’) and 
|S —T|| < 6 then ||(PoSoj)—T|| < 6, so that Po Sof is a linear 
isomorphism of FE, onto F,, and therefore has rank k. Thus rank(S) > 
rank(PoSoj)>k. 


Let us apply Theorem 14.6.7 to some linear integral equations. First sup- 
pose that K is a bounded uniformly continuous real-valued function on the 
square [a,b] x [a,b], that g € C([a,b]) and that » is a parameter in R. We 
seek a solution f € C([a,b]) to the Fredholm integral equation 


b 
(HO +0 7 K(x,y) f(y) dy for x € [a,). 
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K is the kernel of the equation. AK defines an element of L(C([a,b])): if 
f € C([(a, b)) let 


n= [Kew y) dy for x € [a,b]. 


First we show that Tx(f) € C({a,6]). Suppose that «€ > 0. Let 7 = 
e/(b—a)(||f|l,, + 1). There exists 6 > 0 such that 


|K(a,y) — K(a',y')| < n if |x —2"| <6 and ly —y'| < 6. 


If |x — 2’| < 6 then 


b 
ITk (f)(@) — Tr(f)(@‘)| 2) |K(2,y) — K(2’, IF) dy 
<n Ilflloo (b- a) <e. 


Thus Tx (f) is continuous on [a, 6]. 
Further 


(NU) < f IK AFD Ay <O~ A) IK Flo 
where ||A||,, = sup{|K (2, y)| : (zy) € [a,] x [a,b]}. Consequently Tk € 
L(C(({a, b])), and ||Tx|| < (6— a) ||K]|,,. Thus if |A|(6 — a) ||K||,, < 1 then 
||ATK || < 1, and so the continuous linear operator J — AT; is invertible. 
Thus for each g € C({a,}]) there exists a unique f € C([a,b]) such that 
(I — ATK) f = g; the Fredholm integral equation has a unique solution if 


IA} [Tx || < 1. 
Next, we consider the Volterra integral equation 


f(x) te) +d [Kaus y) dy for x € [a, B, 


where K is a bounded uniformly continuous function on the triangle T’ = 
{(z,y):a<y<a <b}. If f € C((a,d)) let 


n= f Kes y) dy for x € [a,b]. 
Again, Vx(f) € C((a, 6]). We claim that 


IVR) (@) < FM" ACI Illy for # € [a,b] and ne Z*, 
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where ||A'||,, = sup{|K (az, y)| : (z,y) € T}. We prove this by induction. The 
result is certainly true when n = 0. Suppose that it holds for n. Then 


Ve (F)(@)| = 


[ kenvainw) a < [1K (e.vR ley 


; Eg fy oy 
<WMle [WR wIay s A Wl fw —a)" dy 


= FA VAIS 


In particular, ||Vz|| < (6 — a)” ||K||5, /n!, and so |A|” ||Vz|| > 0 as n > oo. 
Thus |A|” ||V2|| < 1 for large enough n, and so 1 — AVx is invertible for all 
» © R; the Fredholm integral equation has a unique solution, for all A € R. 
A concluding remark: as we shall see in the next chapter, it is enough to 
assume that the kernels in the Fredholm and Volterra equations are contin- 
uous, since this implies that they are bounded and uniformly continuous. 


Exercises 


14.6.1 Give an example of a surjective contraction mapping f on an 
incomplete metric space (X,d) with no fixed point. 

14.6.2 Suppose that f,g are contractions of a complete metric space (X, d). 
Show that there exists unique points 79 and yo in X such that rp = 
g(yo) and yo = f (zo). 

14.6.3 ae that h € C([a,b]). If f € C({a,d]), let In(f) = 
o f(x)h(«x) dx. Show that if C([a, b]) is given the uniform norm then 
Inisa paces linear functional, and ||Ip,||' = is |h(a)| dx. [Consider 
approximating sums to the integral.] 

14.6.4 Suppose that K is the kernel of a Fredholm operator on C({a, }]). 
Show that ||Tx|| = sup{ | (x, y)| dy : x € [a, bj}. 

14.6.5 Verify that if f € C({a, b]) then V«(f) is continuous. 


14.7 *Baire’s category theorem* 
(This section can be omitted on a first reading.) 


We now prove Baire’s category theorem, which is a straightforward 
extension of Osgood’s theorem to complete metric spaces. 


Theorem 14.7.1 (Baire’s category theorem) Jf (U,,) is a sequence of dense 
open subsets of a complete metric space (X,d) then N°°,U,, is dense in X. 
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Proof Suppose that V is a non-empty open subset of X. We must show 
that VM (n°2,U,) is not empty. Since Uj is dense in X, there exists cy € 
V NU,. Since V NU, is open, there exists 0 < €; < 1/2 such that 


Ne, (ei) C Mz, (c1) CVNU,. 
We now iterate the argument; for each n € N there exist 
Gr = Ne (ai) iG, and 0 = 4, < 1/2” 


such that 
Nz, (Cn) c M.,, (Cn) c Ne. (ni) NU pn. 


The sequence (N¢,(cn))°2, is decreasing, so that if m,p > n then 


n 


Cm © Mz,,(cn) and cp € Me, (cn), so that 
(Crago) S dl Gray Ga) dG, Op) < 2/2" 


thus (c,)°2, is a Cauchy sequence in (X,d). Since (X,d) is complete, it 
converges to an element c of X. Suppose that n € N. Since cm € Mz, (cn) 
for m > n and since M,, (cn) is closed, c € Mz, (cn) C Un. Thus c € NPL. 
Further, c € M.,(c1) CV, and soce V. 


Note that the proof uses the axiom of dependent choice. This cannot be 
avoided: if Baire’s category theorem is true for all complete metric spaces 
then the axiom of dependent choice must hold (Exercise 14.7.4). On the 
other hand, the theorem can be proved for separable complete metric spaces 
without using the axiom of dependent choice (Exercise 14.7.5). 

The following corollary is particularly useful. 


Corollary 14.7.2 Suppose that (C,)°, is a sequence of closed subsets of 
a complete metric space (X,d) whose union is X. Then there exists n such 
that C, has a non-empty interior. 


Proof Let U, = X \ Cy. Then (U;,)°2, is a sequence of open sets and 
nec, U, is empty, and so is certainly not dense in X. Thus there exists Up, 


which is not dense in X; that is C, has a non-empty interior. 


It is sometimes useful to have a local version of this corollary. This depends 
upon the important observation that the hypotheses and conclusions of the 
theorem are topological ones, so that Baire’s category theorem applies to 
topologically complete metric spaces; in particular, by Theorem 14.1.13, it 
applies to open subsets of complete metric spaces, and to Gs subsets of 
complete metric spaces. 
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Corollary 14.7.3 Suppose that (C,)°2, is a sequence of closed subsets of 
a complete metric space (X,d) whose union contains a non-empty open set 
W. Then there exists n such that C, 1 W has a non-empty interior. 


Proof The sets C,W are closed subsets of W whose union is W, and so 
there exists n and a non-empty open subset V of W such that V CC,9W. 
Since W is open in X, it follows that V is open in X. 


Baire proved his theorem (for R”) independently of Osgood. It was 
included in his doctoral thesis, published in 1899. Why is the word ‘cat- 
egory’ used? This is a matter of terminology. A subset A of a topological 
space is said to be nowhere dense if its closure has an empty interior. It is 
said to be of the first category in X if it is the union of a sequence of nowhere 
dense sets, and is said to be of the second category in X if it is not of the first 
category in X. Thus Corollary 14.7.2 states that a complete metric space is 
of the second category in itself. 

Let us now turn to some applications of the theorem. 


Proposition 14.7.4 Suppose that F is a set of continuous mappings from 
a complete metric space (X,d) into a metric space (Y,p), with the prop- 
erty that F(x) = {f(x): f € F} ts bounded, for each x € X. Then there 
exists a non-empty open subset U of X and a positive number K such that 
diam (F'(x)) < K for each x €U. 


Proof If f,g € F then the function « + p(f(x), g(x)) is continuous on X 
and so the set {x € X : p( f(x), 9(x)) < n} is closed. Consequently the set 


C, = {x € X : diam (F(z)) < n} 
=Npger{e € X : p(f(x), g(@)) <n} 


is closed. By hypothesis, X = UP2,C;, and so there exists n such that C,, 
has a non-empty interior. 


The corresponding result for continuous linear mappings is more useful. 


Theorem 14.7.5 (The principle of uniform boundedness) Suppose that A 
is a set of continuous linear mappings from a Banach space (E,_||.||,,) into 
a normed space (F,_||.||,~) with the property that A(x) = {T (x): T € A} is 
bounded, for each x € E. Then sup{||T|| : T € A} is finite. 


Proof Let C, = {x € E: sup{||T(z)|| : T € A} < n}. Then (C;,,)e<, is 
a sequence of closed sets whose union is £, and so there exists n such that 
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C;, has a non-empty interior. Thus there exists 79 € E and € > 0 such that 
M.(zo) C Cy. Let K = 1/e. If T € A and ||z|| p> < 1 then 


T@)lle = K ||T(e@) Ip = K ||P (wo + ex) — T(x) le 
< K(\|T(20 + €#)[lp + IIT (@o) Ile) < 2K, 


so that ||T'| < 2k n. 


The contrapositive is equally useful. 


Theorem 14.7.6 (The principle of condensation of singularities) Suppose 
that D is an unbounded set of continuous linear mappings from a Banach 
space (FE, ||.||,~) into a normed space (F,_||.||,-). If « € E, let D(x) = {T (a) : 
T € D}.Then H = {x € E:: D(x) is unbounded} is of the second category 
in E. 


Proof For each n EN, the set G, = {a € E: suprep ||T(z)|| < n} isa 
closed nowhere dense subset of FE, so that UncenG rn is of the first category in 


E. By Baire’s category theorem, H cannot be of the first category in FE. 


The principle of uniform boundedness has the following consequence. 


Theorem 14.7.7 (The Banach-Steinhaus theorem) Suppose that (T,)°-, 
is a sequence of continuous linear mappings from a Banach space (E, |\.|| 7) 
into a normed space (F,||.||;-), and that T,(a) converges, to T(x), say, as 
n — oo, for each « € E. Then T is a continuous linear mapping from E 
into F’. 


Proof The mapping T is certainly linear. For each x € X, the set 
{T,(x) : n € N} is bounded. By the principle of uniform boundedness, 
there exists K such that ||T,|| < AK for all n € N. If « € FE then 
|Z(@) ||» = limn—soo ||Tn (x) || p> < K ||2||_~, So that T is continuous. 


(Terminology varies; many authors call the principle of uniform bound- 
edness the Banach-Steinhaus theorem.) 

We now combine the Baire category theorem with Corollary 14.4.2 to 
prove some of the most powerful results of Banach space theory. 


Theorem 14.7.8 (The open mapping theorem) Suppose that T is a sur- 
jective continuous linear mapping of a Banach space (E,||.||,~) onto a Banach 
space (F,||.||-). If U is open in E then T(U) is open in F. 


Proof Let Bg be the unit ball in E, Br the unit ball in F’. Let A, = 
T(nBg), for n € N. Then A, = nAj, Aj is convex (Corollary 11.2.2) and 
A, = —A, . Now F = T(U% nBr) = US T(nBr) © U%, An so that 


424 Completeness 


F =U, An. By Baire’s category theorem there exists n so that A, has a 
non-empty interior. Since the mapping y > y/n is a homeomorphism of F, 
A, has a non-empty interior. Thus there exist yo € A, and € > 0 such that 
M-(yo) © At. If |lyl| jp < € then yo+y € Ay and yo—y € Ay. Since Ay = —Aj, 
—yo + y € Aj, and since Aj is convex y = 5((Yo +y)+(—yo + y)) € At, so 


that A; = T(Bg) > Br. The result now follows from Corollary 14.4.2. 


Corollary 14.7.9 (The isomorphism theorem) Jf T is a bijective con- 
tinuous linear mapping of a Banach space (E,_||.||,,) onto a Banach space 
(F,||.\|,-), then T+ is continuous, so that T is a homeomorphism. 


Recall that a continuous mapping from a topological space to a Tj 
topological space has a closed graph. 


Corollary 14.7.10 (The closed graph theorem) Jf T is a linear mapping 
of a Banach space (E, ||.||~) into a Banach space (F,_||.||,~) which has a closed 
graph, then T is continuous. 


Proof The graph Gr of T is a closed linear subspace of the Banach 
space (£,]|.||z) x (F,|l-|-), and so is a Banach space, under the norm 
I(x, T(2))|| = llelly + IT@)llp- If (@,T(@)) € Gr let R((x,T(2))) = T(x) 
and let L((x,T(x)))=a. R is a norm-decreasing linear mapping of G'p into 
F,, and is therefore continuous. L is a bijective norm-decreasing linear map- 
ping of the Banach space G'p onto the Banach space EF; it is continuous, and 
so L~! is continuous, by the isomorphism theorem. Thus T = Ro L~! is 


continuous. 


This theorem says the following. Suppose that T is a linear mapping of 
a Banach space (£,||.||,,) into a Banach space (F,||.||-) with the property 
that whenever (z,)°2, is a sequence in E for which x, — x and T(r) > y 
as n — oo, then T(z,,) + T(x) as n > oo. Then if (#,)°2, is a sequence in 
E for which 2, > xz as n > ov, then T(t,) — T(z) as n > oo. The gain 
may appear to be slight, but this is a powerful theorem. 

The general principle of convergence ensures that the uniform limit of 
continuous functions is continuous, but the same is not true for functions 
which are the pointwise limit of continuous functions. On the other hand, as 
we shall see, not every function on a complete metric space is the pointwise 
limit of continuous functions. Baire used his category theorem to establish 
properties of such limits. 

We shall restrict attention to real-valued functions defined on a complete 
metric space (X,d); the results extend easily to functions taking values in 
a separable metric space. Suppose that f is a function on X. Recall that 
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if A is a subset of X the the oscillation Q(f,A) of f on A is defined as 
Q(f,A) = sup{|f(e) — f(y)| : z,y € A}. If a € X and 6 > O, we set 
O5(f)(x) = Q(f, N5(x)). Then 05(f)(x) is an increasing function of 6 taking 
values in [0,00]. We set Q(f)(x) = inf{0Q5(f)(x) : 6 > O}. Then it is easy to 
see that f is continuous at x if and only if Q(f)(x) = 0. 


Proposition 14.7.11 Jf f is a real-valued function on a metric space 
(X,d) and e > 0 then the set U. = {a € X : O(f)(x) < €} ts open in (X,d). 


Proof Suppose that « € U,.. There exists 6 > 0 such that 05(f)(x) < ¢. If 
y € Nj(a), there exists 7 > 0 such that N,(y) C No(x). Then 


Q(F)(Y) S On(P)(y) < M6(F) (a) <6, 


so that y € U.. Thus N5(x) C U;, and U; is open. 


Corollary 14.7.12 Suppose that f is a real-valued function on a complete 
metric space (X,d) for which the set U. = {x € X : O(f)(x) < e} is dense 
in X, for eache > 0. Then the set C of points of continuity of f is dense in 
(X, d). 


Proof For C =M72.,U1/n, and so the result follows from Baire’s category 


theorem. 


Theorem 14.7.13 Suppose that f is the pointwise limit of a sequence 
(fn)°1 of continuous functions on a complete metric space (X,d). Then 
the set C of points of continuity of f is dense in (X,d). 


Proof | Weshow that the conditions of Corollary 14.7.12 are satisfied. Sup- 
pose that « > 0. If 7 € Z let aj = je/4 and let b; = a; + €/2, so that 
R = Ujez(a;, bj). Suppose that V is a non-empty open subset of X. Recall 
(Theorem 14.1.13) that V is topologically complete; there is a complete 
metric on V which defines the subspace topology of V. Let 


Ang ={2e V i fale) = las, 0;) for m2 n} 
= Am>n{£ eV: fm(z) € [a;, bj]}. 


Then Ay; is a closed subset of V, and V = U{An,j; :n € N,j € Z}. Note 
that if € An; then f(x) € [a;, bj]. By Baire’s category theorem, there exist 
n,j such that A,,; has a non-empty interior in V. Since V is open in X, 
An, has a non-empty interior in X. Thus there exist x ¢ V and 7 > 0 such 
that N,(z) C Anz. Then 2(f)(x) < On(f)(x) < €/2 < €, so that x Ee VNU, 
and U, is dense in (X,d). 
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In fact, we can say more. 


Proposition 14.7.14 Suppose that f is a real-valued function on a com- 
plete metric space (X,d) and that the set C of points of continuity of f is 
dense in (X,d). Then the set D of points of discontinuity of f is of the first 
category in X. 


Proof For D=U%,B,, where B, = {x € X : Q(f)(x) > 1/n}, and each 
B,, is closed and nowhere dense. 


Corollary 14.7.15 If (X,d) has no isolated points, then C is uncountable. 


Proof If not, then C is the union of countably many singleton sets, each 
of which is nowhere dense. Thus X = U?2_,B, UC is the countable union of 


closed nowhere dense sets, giving a contradiction. 


We end this section with a remarkable result of the Catalan mathemati- 
cian Ferran Sunyer y Balaguer. 


Theorem 14.7.16 Suppose that f is an infinitely differentiable function 
on (0,1) with the property that for each x € (0,1) there exists n € Zt such 
that f(x) =0. Then f is a polynomial function. 


Proof Let An = {x € (0,1) : f(a) = 0}, and let E, be the interior of 
An. Let E = US 9Fn, and let F = (0,1) \ &. Since each A, is closed and 
since if [a,b] C (0,1) then [a,b] = UP29(An / [a, b]), it follows from Baire’s 
category theorem that E is dense in (0,1). In particular, there exists n € N 
such that E, is not empty. Note that ifm > n then f(x) = 0 for x € En, 
so that E, C Em. Note also that, by continuity, f° (x) = 0 for z € Ep. 
We shall show that there exists n such that FE, = (0,1). Suppose not. If 
En #0 then E,, is the union of countably many disjoint open intervals, the 
constituent intervals of En. 

Now E is open, and is the union of countably many disjoint non-empty 
open intervals, the constituent intervals of E. Suppose that I is one of them, 
and that « € J. Then there exists a least m € Z* for which x € E,, and x 
is in a constituent interval J,,, of E,,. Then I, C I. We show that J, = I. 
If not, one of the endpoints of J,, is in J. Without loss of generality, we can 
suppose that it is a right-hand endpoint b. Since the sequence (E,)°2, is 
increasing, and b ¢ E,, there exists a least integer p > m such that b € Ep. 
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Since Im C Em © Ep-1, it follows that 6b is a right-hand endpoint of a 
constituent interval of E,-1. Consequently, f?~1(b) = 0. Since E, is open, 
there exists c > b such that (b,c) C Ep. If x € (b,c) then 


fN(e) = FONE) + [FO Wat =o, 
b 


so that (b,c) C Ep—1. Consequently b € E,—1, contradicting the minimality 
of p. Thus I, = I. Consequently, the constituent intervals of E,,+1 are either 
constituent intervals of E,,, or are intervals disjoint from E,,. 

The set F' = (0,1) \ E is a closed nowhere-dense subset of (0,1). It is also 
a perfect subset of (0,1). For if b were an isolated point of F’ there would be 
two disjoint open intervals in F with b as end-point. Thus there would exist 
(a,b) C E, and (b,c) C En, for someO< a<b<c<landm,neEN. But 
then 6 € (a,¢) © Emax(mn) C FE, giving a contradiction. 

We now apply Baire’s category theorem again, this time to the sequence 
(A, F)°<_, of closed subsets of F’. It follows from Baire’s category theorem 
that there exist n € N, « € F and 7 > 0 such that N,(x) OF C Ap. If 
y € N,(x)NF, y is not an isolated point of F’, and so there exists a sequence 
(yj)F21 in (N, (x) NF) \ {y} which converges to y. Consequently 
joo Yj = 


Thus N,(«) 1 F C An41. Iterating the argument, N,,(a) OF C UpsnAp. 

Suppose now that z € N,(xz)M E. Then z is in one of the constituent 
intervals I of E and one of its end-points, b say, is in N, 9 F’. We can 
suppose, without loss of generality, that b < z. Further, there is a least 
integer p such that J is a constituent interval of E,. Suppose, if possible, 
that p > n. Then, arguing as above, if w € J then 


fPD(w) = feV0) +f f)(t) dt = 0, 
b 
so that J is a constituent interval of E,—1, contradicting the minimality of 


p. Thus p < n, so that z € E,. Hence N,(x) C En, contradicting the fact 
that z € F. 


Corollary 14.7.17 Suppose that f is an infinitely differentiable function 
on R. with the property that for each x € R there exists n € Zt such that 
f(x) =0. Then f is a polynomial function. 
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For f is a polynomial function on each bounded open interval of 


R, and two polynomial functions which are equal on an interval must be 


defined by the same polynomial. 


14.7.1 


14.7.2 


14.7.3 


14.7.4 


14.7.5 


Exercises 


Use Baire’s category theorem to show that a perfect subset of a 

complete metric space is uncountable. 

Show that the real line is not the union of a set of proper non-trivial 

disjoint open intervals. 

Let G,, be the set of functions f in C((0,1]) for which there exists 

0 <a <1 for which | f(x) — f(y)| < n|x — y| for all y € [0,1]. Show 

that G, is a closed subset of C(([0,1]). Show that G, is nowhere 

dense. Deduce that the set of continuous functions on [0,1] which 

are nowhere differentiable is of the second category in C((0, 1}). 

This exercise shows that if Baire’s category theorem is true, then 

the axiom of dependent choice must hold. Suppose that X is a non- 

empty set, and that ¢ is a mapping from X into the set of non-empty 
subsets of X. Let X, = X for n € Zt, and let P = []?- 9 Xn. 

Give each X,, the discrete metric, and give P a uniform product 

metric d. 

(a) Show that (P,d) is a complete metric space. 

(b) Ifn € Zt, let V, ={f € P: there exists k > n with fr € O(fn)}- 
Show that V,, is open and dense in (P, d). 

(c) If Baire’s category theorem is true, there exists f € N°2 Vn. 
Ifn € Zt, let j(n) = inf{k: k > n, fe € bn}. Use recursion 
to show there exists an increasing sequence (c,)?29 such that 
co = 0 and f(cn41) € bc, forn € NT. 

(d) Show that the axiom of dependent choice holds. 

Suppose that (X,d) is a separable complete metric space. Prove 

Baire’s category theorem for (X,d) without using the axiom of 

dependent choice. Let S = {s1,52,...} be a countable dense sub- 

set of (X,d), and let r1,7r2,... be an enumeration of the positive 
rational numbers. Show that at each stage there is a least j(n) such 

that if c, = $j(m) then cp € Ne,_,(Cn—-1) NUn 1S, and a least k(n) 


such that if €, = Tx) then 
Neen) Cc M(x) Cc Neo, (Ca—1) Uy. 


In particular, Osgood’s theorem does not need the axiom of 
dependent choice. 
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14.7.6 Suppose that f is a continuous function on T. The n-th Fourier 
coefficient f, of f is defined as 


f eye i —int it 
fax f Pe vat: 


Let 
n 
t) = fet. 
j=—n 
In Volume I, Section 9.5, we constructed an example of a continu- 
ous function on T whose Fourier series is unbounded at 0. In this 
exercise, we show that the set of functions for which this is true is 


of the second category in C(T). 
(a) Show that 


where 


n F 1 
= = gt __ Sin(n + 5)t 
D,,(0) = 2n+1 and Dalit) = ze a ~sint/2_ otherwise. 
(b) IfO <t< 17, let fale ) = sin(n + 4)t, and if —7 < t < 0, let 
fale’ it) = —sin(n + $)t. Let t; = ja/(2n + 1). Show that 


1 ) sin?(n + 4)t 
S 0)=- —_—_—* di 
n( fr) (0) 1 dX [ sint/2 
2n+1 i‘. 2n+1 
1 2 j 1 1 
> rs sin?(n + $)tdt = — =. 
a jal J vti-2 at j=l J 


(c) Let dn(f) = Sn(f)(0). Deduce that (dn)?29 is a sequence of 
continuous linear functionals on C(T) which is unbounded in 
norm. 

(d) Use the principle of condensation of singularities to show 
that the set of functions f in C(T) for which the sequence 
(S,(f)(0))°21 is unbounded is of the second category in C(T). 

14.7.7 Suppose that T is a linear mapping of a normed space (F£, ||.||;;) 
into a normed space (F;_||.||;-). Show that T has a closed graph if 

and only whenever z, > 0 in E and T(z,) > y in F then y = 0. 
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Suppose that T is a linear mapping from a Hilbert space H into 
itself for which (T(x), y) = (x,T(y)) for all x,y € H. Show that T 
is continuous. 

Let w be the vector space of all real sequences, and let @ be the 
linear subspace of all sequences with finitely many non-zero terms. A 
Banach sequence space (E,||.||,;) is a Banach space (£, ||.||~), where 
FE is a linear subspace of w which contains ¢ with the property 
that if (c(™)°2, is a sequence in E for which Ix || +0 as n- 00 
then ai") —0 as n- oo for each j EN. Show that if (£,||.||,,) and 
(F, ||-|| 2) are Banach sequence spaces and E C F then the inclusion 
mapping £ —> F is continuous. 

Suppose that ||.|| is a complete norm on the space Cy(X) of bounded 
continuous real-valued functions on a topological space (X,T) with 
the property that if || f,|| > 0 as n — oo then f,(x) > 0 as n > 00 
for each x € X. Show that the norm ||.|| is equivalent to the uniform 
norm ||.||,,- 

Give an example of a norm ||.|| on C([0,1]) with the property that 
if || fn|| 2 0 as n > co then f,(x) > 0 as n > co for each x € [0,1] 
which is not equivalent to the uniform norm ||.||,.. 


15 


Compactness 


15.1 Compact topological spaces 


Two of the most powerful results that we met when considering functions of 
a real variable were the Bolzano—Weierstrass theorem and the Heine—Borel 
theorem. Both of these involve topological properties, and we now consider 
these properties for topological spaces. We shall see that they give rise to 
three distinct concepts; in Section 15.4, we shall see that these three are the 
same for metric spaces. 

We begin with compactness; this is the most important of the three 
properties. It is related to the Heine—Borel theorem, and the definition is 
essentially the same as for subsets of the real line. If A is a subset of a set 
X and B is a set of subsets of X then we say that Bb covers A, or that B 
is a cover of A, if A C UpegB. A subset C of B is a subcover if it covers 
A. A cover B is finite if the set GB has finitely many members. If (X,7) is 
a topological space, then a cover B is open if each B € B is an open set. 
A topological space (X,7) is compact if every open cover of X has a finite 
subcover. A subset A of a topological space (X,7) is compact if it is com- 
pact, with the subspace topology. If U is a subset of A which is open in the 
subspace topology, there exists an open subset V of X such that U =VNA, 
and so A is a compact subset of X if and only if every cover of A by open 
subsets of X has a finite subcover. 

The Heine—Borel theorem states that a subset of R is compact if and only 
if it is closed and bounded. 

We can formulate the definition of compactness in terms of closed sets: 
this version is quite as useful as the ‘open sets’ version. Recall that a set 
F of subsets of a set X has the finite intersection property if whenever 
{F,,..., Fp} is a finite subset of F then Nj-1F; is non-empty. 
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Theorem 15.1.1 A topological space (X,7T) is compact if and only if when- 
ever F is a set of closed subsets of X with the finite intersection property 
then the total intersection NpeFF is non-empty. 


Proof This is just a matter of taking complements. Suppose that 
NrefF = 0. Then {C(F) : F € F} is an open cover of X, and so there 
is a finite subcover {C(F1),--- ,C(F,)}. Thus 


SSCP UC G,) —=C fi... 1P,), 


so that F,N...NF, =, contradicting the finite intersection property. 


The converse is as easy, and is left to the reader as an exercise. 
We have the following ‘local’ corollary. 


Corollary 15.1.2 Suppose that C is a set of closed subsets of a compact 
topological space (X,T) and that NcecC is contained in an open set U. Then 
there exists a finite subset F of C such that NgexC CU. 


Proof Let Cj} =CU{X \U}. Then C, is a set of closed subsets of X, and 
Acec,C = 0, and so C, fails to have the finite intersection property. There 
exists a finite subset F of C such that (NcerC) A(X \ U) = 0: that is, 
Neer CU. 


Proposition 15.1.3 Suppose that (X,7T) is a topological space and that A 
is a subset of X. 

(i) If (X,7) is compact and A is closed, then A is compact. 

(it) If (X,7) is Hausdorff and A is compact, then A is closed. 

(iti) If (X,7) is compact and Hausdorff then it is normal. 


Proof (i) Suppose that F is a set of closed subsets of A with the finite 
intersection property. Since A is closed, the sets in F are closed in X. Since 
(X,7) is compact, N{C : C € F} is not empty. 

(ii) Suppose that x ¢ A. We shall show that there are disjoint open sets 
U and V with A CU and a © V. For each a € A there exist disjoint open 
subsets U, and V, of X with a € U, and x € Vy. The sets {Uz : a € A} 
form an open cover of A, and so there is a finite subset F' of A such that 
{U, : a € F} is a finite subcover of A. Then U = U{U, : a € F} and 
V =M{V,: a € F} are disjoint open sets, and A CU, 2 € V. Thus z ¢ A, 
so that A = A, and A is closed. 

(iii) Suppose that A and B are disjoint closed subsets of X. We repeat 
the argument used in (ii). For each b € B, there exist disjoint open subsets 
U, and V, of X with A C U, and b € Vy. The sets {Y, : b € B} form an open 
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cover of B, and so there is a finite subset G of B such that {V, : b € G} is 
a finite subcover of B. Then U =N{U,: 6 € G} and V =U{VY,: b € G} are 
disjoint open sets, and ACU, BCYV. 


Compact spaces which are not Hausdorff are less well behaved. For exam- 
ple, if X is an infinite set with the cofinite topology tr then (X,7,/) is 
compact, and so are all of its subsets. Some authors include the Hausdorff 
property in their definition of compactness, and we shall concentrate our 
attention on such spaces. 


Proposition 15.1.4 Suppose that f is a continuous mapping from a topo- 
logical space (X,T) into a topological space (Y,o). If A is a compact subset 
of X then f(A) is a compact subset of Y. 


Proof Suppose that U is an open cover of f(A). If U €U then f—!(U) is 
open, since f is continuous. Thus {f~!(U) : U € U} is an open cover of A. 
Since A is compact, there is a finite subcover {f~'(U1),..., f~'(Un)}. Then 
{U1,...,Un} is a finite subcover of f(A). 


Corollary 15.1.5 Suppose that f is a continuous real-valued function on a 
compact space (X,T). Then f is bounded on A, and attains its bounds: there 
exist y€ X with f(y) =supgex f(x) and z € X with f(z) = infrex f(z). 


Proof For f(X) is a compact subset of R, and so is bounded and closed, 
by Theorem 5.4.4 of Volume I. 


Proposition 15.1.6 Suppose that f is a continuous mapping from a com- 
pact topological space (X,T) onto a Hausdorff topological space (Y,o), and 
that g is a mapping from (Y,c) into a topological space (Z,p). Then g is 
continuous if and only if go f is continuous. 


Proof If g is continuous then certainly g o f is continuous. Conversely, 
suppose that go f is continuous. Suppose that C' is a closed subset of Z. 
Then (go f)~!(C) is closed in X, and is therefore compact, by Proposition 
15.1.3 (i). Thus g~!(C) = f((go f)~1(C)) is compact, by Proposition 15.1.4, 
and is therefore closed, by Proposition 15.1.3 (ii). Thus g is continuous. 


Corollary 15.1.7 If f is a continuous bijection from a compact topo- 
logical space (X,T) onto a Hausdorff topological space (Y,o), then f is a 
homeomorphism. 


Proof Take g = f7!. 


The topology of a compact Hausdorff space has a certain minimal 
property. 
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Corollary 15.1.8 Suppose that (X,r) is a compact Hausdorff space, and 
that o is a Hausdorff topology on X which is coarser than tT. Then o =T. 


Proof Apply the corollary to the continuous identity mapping from (X,7) 
to (X,¢). 


Theorem 15.1.9 The product of finitely many compact spaces is compact. 


Proof A standard induction argument shows that it is enough to prove 
that the product of two compact spaces (X,7) and (Y,c) is compact. Sup- 
pose that U/ is an open cover of X x Y. If P = (x,y) € X x Y, there exists 
Up €U with P € Up. Since Up is open, there exist open neighbourhoods 
Vp of x and Wp of y such that Vp x Wp C Up. It is then clearly sufficient 
to show that finitely many of the sets Vp x Wp cover X x Y. 

Suppose that « € X. The cross-section C, = {(xz,y) : y € Y} is home- 
omorphic to Y, and is therefore compact. It is covered by the collection 
{Viay) X Way) : y € Y} of open sets, and is therefore covered by a finite 
subset {Viey,) X Wiey,) 2 1 <7 < nj. Let Qe = Nf_yViey,). Then Qz is 
an open neighbourhood of x, and Q; x Y C UF_1Vx,y5) x Wia,y,)- The sets 
{Qz : x € X} cover X. Since (X,7) is compact, there is a finite subcover 
{Qz,,-++,Qz,,}. Then the sets {Qz, x Y,...,Qz,, x Y} cover X x Y. Since 
each of then is covered by finitely many sets Vp x Wp, X x Y is covered by 


finitely many sets Vp x Wp. 


Corollary 15.1.10 A subset A of R% or C% is compact if and only if it 
is closed and bounded. 


The proof of Theorem 15.1.9 is rather awkward, and only deals with the 
product of finitely many spaces. In fact, a careful use of the axiom of choice 
can be used to prove the following. 


Theorem 15.1.11 (Tychonoff’s theorem) Jf (Xa,Ta)aca is a family of 
compact topological spaces, then || <4 Xa is compact in the product topology. 


In particular, P(X), with the Bernoulli topology, is compact. 
To prove this, ‘sequences’ are replaced by ‘filters’. This involves introduc- 
ing a fair amount of machinery. A proof is given in Appendix D. 


Exercises 


15.1.1 Show that the union of finitely many compact subsets of a topological 
space is compact. 

15.1.2 Show that the intersection of a collection of compact subsets of a 
Hausdorff topological space is compact. 
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15.1.3 Give an example of two compact subsets of a 7; topological space 
whose intersection is not compact. 

15.1.4 Suppose that (X1,7,) and (X2,72) are Hausdorff topological spaces 
and that (X2,72) is compact. Show that if A is a closed subset of 
X 1 x Xq then 71(A) is closed in (Xj,71). 

15.1.5 Suppose that f is a mapping from a topological space (X1,71) into a 
compact topological space (X2, 72) whose graph G'y is a closed subset 
of Xx X2. Show that f is continuous. Can the condition that (X2, dz) 
is compact be dropped? 

15.1.6 Suppose that G is a closed subgroup of (R%,+) which does not con- 
tain a line (if z € R@\ {0}, then l, = {ax : a € R} is not contained 
in G). Suppose that x € S4! = {x € R¢: ||z|| = 1}. By consid- 
ering l, 7 G, show that there exist r > 0 and € > 0 such that if 
y € N(x) S41 and 0 <a <r, then ay ¢ G. Use the compactness 
of S¢-1 to show that G is a discrete subset of R”. 

15.1.7 Let B,([0,1]) be the collection of subsets of [0,1] with at most n 
elements. Show (without appealing to Tychonoff’s theorem) that 
B,,([0, 1]) is a compact subset of P((0, 1]), with the Bernoulli topology. 
(Hint: induction on n.) 


15.2 Sequentially compact topological spaces 


We now make a definition inspired by the Bolzano—Weierstrass theorem. 
We say that a topological space (X,7T) is sequentially compact if whenever 
(%n)P21 is a sequence in X then there exists a subsequence (xp, )?2., and an 
element x € X such that z,, — x as k — oo. We say that a subset A of 
X is sequentially compact, if it is sequentially compact, with the subspace 
topology: if (an)°2, is a sequence in A then there exists a subsequence 
(Gn, p21 and an element a € A such that an, 4 a as k — oo. 

Thus the Bolzano—Weierstrass theorem implies that a subset of R is 
sequentially compact if and only if it is closed and bounded; that is, if and 
only if it is compact. 


Proposition 15.2.1 Suppose that (X,rT) is a sequentially compact topo- 
logical space. 

(i) A closed subset A of X is sequentially compact. 

(ii) If f is a continuous mapping of (X,7) into a topological space (Y,c) 
then f(X) is sequentially compact. 
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Proof (i) Suppose that (a,)°2, is a sequence in A. There is a subsequence 
(an, )P21 and an element x € X such that an, — x as k > oo. Since A is closed, 
LEA. 

(ii) Suppose that (y,)°2, is a sequence in f(X). For each n € N there 
exists , € X such that f(%,) = yn. Then there exist a subsequence (ap, )7°4 
and an element x € X such that tp, > x as k > oo. Then yn, = f(tn,) 
f(z) as k 4 ow. 


Theorem 15.2.2 Suppose that (X,7) is a finite product []')_4(Xj,7j) or 
a countably infinite product Wj, Tj) of sequentially compact topological 
spaces. Then (X,T) is sequentially compact. 


Proof We consider the countably infinite case: the finite case is easier. We 


use a diagonal argument, as in the proof of the Bolzano—Weierstrass theo- 


oc_, is a sequence in X. There exists a subsequence 
(y“*))e° , and an element y, of X) such that yl) — y, as k — oo. Induc- 


rem. Suppose that (a) 


tively, for each 7 € N we can find a subsequence (y¥"))% , of (y9-1*))% | 
and an element y; of such that yo) — yj as k—+oo. Then (Gree, is a 


subsequence of (Gee and ye) — yj as k — oo, for each j € N. Thus if 


we set y = (yj)72, then y(kk) 5 y in (X,7) as k > oo. 


Corollary 15.2.3 The Hilbert cube is sequentially compact. 
Let us give two examples, related to these results. 


Example 15.2.4 An uncountable product of sequentially compact topo- 
logical spaces which is compact, but not sequentially compact. 


Let S be the Bernoulli sequence space Q(N); S is the set of all 
sequences taking the values 0 and 1, and is an uncountable set. Let 
X = Q(S), with the product topology. Then (X,7) is compact, by 
Tychonoff’s theorem. We show that (X,7) is not sequentially compact. For 
né€ Nand s€S let x{” = 5,. Then (a'))°°_, is a sequence in (X,7). We 
shall show that it has no convergent subsequences. Suppose that (ar(™))e0 | 
is a subsequence. Define an element s of S by setting s,, = 1 if k is even, 
and setting s, = 0 otherwise. Then a") takes each of the values 0 and 1 


infinitely often, and so does not converge. 


Example 15.2.5 A sequentially compact subset C’ of a Hausdorff topo- 
logical space (X,7) which is not closed, and is therefore not compact. 
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Take (X,7) the space of the preceding example; it is a Hausdorff space. 
Let 


C={xeX:{s ES: x, =1} is countable}. 


C is a dense proper subset of X, and so is not closed. Suppose that Gin yen, 


is a sequence in C. For n EN, let S,={sES: af = 1} and let S.o = UP? 
Sp. Then Soo is countable, and if s € S \ S. then a”) = 0 foralneN. 
A diagonal argument just like that of Theorem 15.2.2 shows that there is a 
subsequence (a"*))®_, such that a") converges, to 1,, say, as k — oo, for 
each s € Soo. Thus if we set 1, = 0 fors € S\ So then! € C and rp, > | 
as k + oo. 

We now introduce a topological property that is rather weaker than 
sequential compactness. We need a definition. If (x) is a sequence in a 
topological space (X,7), and « € X, then z is a limit point of the sequence 
if whenever N is a neighbourhood of x and n € N, there exists m > n 
such that z,, € N. A topological space (X,7) is countably compact if every 
sequence in (X,7) has a limit point. 


Proposition 15.2.6 A sequentially compact topological space (X,T) is 
countably compact. 


Proof Suppose that («”))%° 


n=1 
a subsequence (a("*))% | which converges to x. Then z is a limit point of 


the sequence (a) )°°,. 


is a sequence in X. There exists x € X and 


Proposition 15.2.7 Suppose that (X,rT) is a countably compact topologi- 
cal space. 

(i) A closed subset A of X is countably compact. 

(ii) If f is a continuous mapping of (X,7) into a topological space (Y,c) 
then f(X) is countably compact. 


Proof The proof is very similar to the proof of Proposition 15.2.1, and the 


details are left to the reader. 


Proposition 15.2.8 A first countable topological space (X,T) is sequen- 
tially compact if and only if it is countably compact. 


Proof It is enough to show that if (X,7) is countably compact, then it 
is sequentially compact. Suppose that (x,,)°2y 
limit point J, and / has a decreasing base of neighbourhoods (B;,)?_,. There 
then exists a subsequence (vp, )72, such that x, € By, for k €¢ N. Then 


Ln, — las k + co, so that (X,7) is sequentially compact. 


is a sequence in X. It has a 
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What is the relationship between compactness and countable compact- 
ness? 


Proposition 15.2.9 <A topological space (X,T) is countably compact if 
and only if every cover of X by a sequence (O,)°°, of open sets has a finite 
subcover. 


Proof Suppose that (O,,)?2, is an open cover of X, and that there is no 
finite subcover. Then for each n there exists x, € X \ (U%_,O;). We show 
that the sequence (z%,,)°2, has no limit point in X, so that (X,7) is not 
countably compact. If « € X, then x € O, for some n € N, so that O, is an 
open neighbourhood of x. Since 2; ¢ On for 7 > n, it follows that x is not 
a limit point of the sequence (x,,)°°.,. Thus the condition is necessary. 
Conversely, suppose that every cover of X by a sequence (O,,)°°, of open 
sets has a finite subcover. Taking complements, this implies that if (F;,)?°, is 
a sequence of closed sets with the finite intersection property then N72, Fr, ¢ 
). Suppose that (2,)°2 4 
F,, = Ty. Then (F;,)°2 1 is a decreasing sequence of non-empty closed sets, 
and so has the finite intersection property. Thus there exists x € N°2., Fp. If 
né€Nand NEWN,, then NNT, 4 0, so that there exists m > n such that 


lo) 
n=l" 


is a sequence in X. Let T,, = {xj : j > n}, and let 


Im € N: x is a limit point of the sequence (x) 


Corollary 15.2.10 Jf (U,)°2, is an increasing sequence of open subsets 
of a countably compact topological space (X,T) whose union is X, then there 
exists no © N such that Uy, = X. 


Corollary 15.2.11 A compact topological space is countably compact. 


In fact, countable compactness is sufficient for many problems concerning 
sequences of functions, as the next result shows. Recall that a sequence of 
continuous real-valued functions on a closed interval which converges point- 
wise to a continuous function need not converge uniformly. Things improve 
if the convergence is monotone. 


Theorem 15.2.12 (Dini’s theorem) Suppose that (fn)? is an increasing 
sequence of continuous real-valued functions on a countably compact topo- 
logical space (X,T) which converges pointwise to a continuous function f. 
Then fn — f uniformly as n —- oo. 


Proof Suppose that « > 0. Let U, = {f € X : fn(x) > f(a) — é}. 
Since the function f — f, is continuous, U, is open. (U;,)°2, is an increas- 


ing sequence, and UnenUn = X, since f,(xz) > f(x) for each x € X; 
{Un : n € N} is an open cover of X. Since (U,,)°°, is an increasing sequence, 
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there exists no € N such that U,, = X (Corollary 15.2.10). If m > ng and 
xe X then 


O< f(x) _ fn(2) = f(x) _ Sno (2) <€, 


so that ||f — frill, <6 fn 7 f uniformly as n — oo. 


15.3 Totally bounded metric spaces 


We now consider what happens when we restrict attention to metric spaces. 
We need to introduce one further idea. If (X,d) is an unbounded metric 
space then the function d/(x,y) = min(d(x,y),1) is a metric on X which 
is uniformly equivalent to d, and X is bounded under this metric. Thus 
boundedness is not a uniform property. We introduce a stronger boundedness 
property that is preserved under uniform homeomorphisms. Suppose that 
€ > 0. A subset F’ of a metric space (X,d) is an e-net if UperNe(x) = X; 
every point of X is within ¢ of a point of F’. (X,d) is totally bounded, or 
precompact, (the two names are used equally frequently, but we shall prefer 
the former) if, for every « > 0, there exists a finite e-net; X can be covered 
by finitely many open neighbourhoods of radius ¢. In other terms, (X,d) is 
totally bounded if and only if, for every « > 0, X is the union of finitely 
many subsets of diameter at most «. A subset A of X is totally bounded if it 
is totally bounded with the subset metric. This concept is not a topological 
one. For example, the subset (—7/2,7/2) is clearly totally bounded under 
the usual metric, but is not totally bounded under the metric p(z,y) = 
|tanz — tany|, since tan defines an isometry of ((—7/2,7/2),p) onto R, 
with its usual metric, and the latter is certainly not totally bounded. 


Proposition 15.3.1 A totally bounded subset A of a metric space (X,d) 
is bounded. 


Proof Take « = 1. There exists a finite subset F' of A such that A C 
UrerNi(x). If y1, yo € A then there exist 21,22 € F' such that y; € Ni(21) 
and y2 € No(x2). By the triangle inequality, 


d(y1,y2) < d(yi, 21) + d(x1, £2) + d(x2, yo) < diam (F) + 2, 


so that A is bounded. 


Boundedness is not a uniform property, but total boundedness is. 


Proposition 15.3.2 Suppose that A is a totally bounded subset of a metric 
space (X,d) and that f is a uniformly continuous mapping of (X,d) into a 
metric space (Y,p). Then f(A) is a totally bounded subset of (Y,p). 
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Proof Suppose that « > 0. Then there exists 6 > 0 such that if d(x, y) <6 
then p(f(x), f(y)) < e. Let F be a finite d-net in A. Then f(F’) is a finite 
e-net in f(A). 


Corollary 15.3.3 If dandd' are uniformly equivalent metrics on X, then 
(X,d) is totally bounded if and only if (X,d’) is. 


Proposition 15.3.4 A totally bounded metric space (X,d) is second 
countable, and is therefore separable. 


Proof For each n € N there exists a finite 1/n-net F, in X. Let 
Un = {Nijn(x) : 2 © Fy} and let U = UnenUy. Then YU is a count- 
able collection of open subsets of X. Let us show that it is a base for 
the topology. Suppose that O is an open subset of X and that x € X. 
There exists 6 > 0 such that N(x) C O. Choose n so that 1/n < 6. 
Then there exists Ur = Ny/an(y) € Un such that x € U,. If z € Uz then 
d(z,x) < d(z,y) + d(y,x) < 1/n so that z € O. Thus xz € U, C O, so that 
O=U{U €uU:U CO}, and U is a base for the topology. 


Proposition 15.3.5 Suppose that S is a dense totally bounded metric 
subspace of a metric space (X,d). Then (X,d) is totally bounded. 


Proof Suppose that ¢ > 0. There exists a finite subset F' of S' such that 
S=U{N.20S5: f € F}. Ifa € X there exists s € S with d(z,s) < €/2 and 
there exists f € F' with d(s,f) < €/2. Thus d(z,f) < ¢, and F is a finite 
e-net in X. 


Corollary 15.3.6 A metric space (X,d) is totally bounded if and only if 
its completion is totally bounded. 


Total boundedness can be characterized in terms of Cauchy sequences. 


Theorem 15.3.7 A metric space (X,d) is totally bounded if and only if 
every sequence in X has a Cauchy subsequence. 


Proof Suppose first that (X,d) is totally bounded, and that (#,)°°, is 
a sequence in X. We use a diagonal argument to obtain a Cauchy subse- 
quence. There exists a finite cover {A1,..., Ax} of X by sets of diameter at 
most 1. By the pigeonhole principle, there exists j such that x, € Aj; for 


oo 
n=1 


such that d(y1m, Yin) < 1 for m,n € N. Repeating the argument, there 


exists a subsequence (Yy2n)?1 of (Yin)? such that d(yom,yon) < 1/2 
for m,n € N, and, iterating the argument, for each j there exists a 


infinitely many n. That is, there exists a subsequence (y1,)°2 1 of (%n) 
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subsequence (yj;+1,n)e—1 Of (yj.n)721 such that d(yj41.m,yj4in) < 1/9 +1) 
for m,neEN. The sequence (55) Goa is then a Cauchy subsequence of 
(@n)r=1- 

Suppose next that (X,d) is not totally bounded; there exists « > 0 such 
that there is no finite e-net in X. Choose 7; € X. Then N,(x 1) 4 X, and so 
there exists 72 € X with d(#1, x2) > e. Iterating this argument, there exists 
a sequence (%p,)72., such that for each n € N, @p4i ¢ UF_)Ne(x;). Thus 
ifm # n then d(t%m,2n) > €, and so the sequence (z,,)72., has no Cauchy 
subsequence. 


Exercise 


15.3.1 Show that a subset A of a metric space (X,d) is totally bounded if 
and only if whenever ¢€ > 0 there exists a finite subset G of X such 
that A C Uzeg Ne (2). 


15.4 Compact metric spaces 


Things work extremely well for metric spaces. 


Theorem 15.4.1 Suppose that (X,d) is a metric space. The following are 
equivalent: 

(i) (X,d) is compact; 

(ii) (X,d) is sequentially compact; 

(itt) (X,d) is countably compact; 

(iv) (X,d) is complete and totally bounded. 


Proof We have seen that (i) implies (iii) (Corollary 15.2.11), and that (ii) 
and (iii) are equivalent (Proposition 15.2.8). 

Let us show that (ii) and (iv) are equivalent. Suppose first that (X, d) 
is sequentially compact, and suppose that (x,,)°2, is a Cauchy sequence 
in X. Then (z,,)°°, has a convergent subsequence, and so by Proposition 
14.1.1 (ap)?@, is convergent. Thus (X,d) is complete. Since every sequence 
has a convergent subsequence, which is a Cauchy subsequence, (X,d) is 
totally bounded, by Theorem 15.3.7. Conversely, suppose that (X,d) is 


lore) 
n=1 


(X,d) is totally bounded, there is a Cauchy subsequence (ap, )?2,, and this 


complete and totally bounded, and that (z,,) is a sequence in X. Since 


subsequence converges, since (X,d) is complete. 
Finally let us show that (ii) and (iv) imply that (X,d) is compact. We 
need a lemma, of interest in its own right. 
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Lemma 15.4.2 If O is an open cover of a countably compact metric space 
(X,d), there exists 6 > 0 such that for each x € X there exists O € O for 
which Ns(x) CO. 


Proof Suppose not. Then for each n € N there exists x, € X for which 
Nj /n(@n) is not contained in any O € O. Let x be a limit point of the 
sequence (x,,)°°.,. Then x € O, for some O € O. Since O is open, there exists 
€ > 0 such that N.(x) C O. Since z is a limit point of the sequence, there 
exists n > 2/e such that zp € N.(x). Ify © Ny jn(@n), then d(y, In) < €/2, so 
that d(y,x) < d(y,%n) + d(an, x) <e. Hence Ny /_(2%n) C Ne(x) C O, giving 
a contradiction. 


A number 6 which satisfies the conclusion of the lemma is called a Lebesgue 
number of the cover. 

Suppose now that © is an open cover of (X,d). Let 6 > 0 be a Lebesgue 
number of the cover. Since (X,d) is totally bounded, there exists a finite 
6-net F' in X. For each x € F there exists O, € O such that N5(x) C O;. 
Then X = UzerN5(x) = UrerOx, so that {O, : x € F} is a finite subcover 
of X. 


Note that neither of the conditions of (iv) is a topological condition, but 
that together they are equivalent to topological conditions. 
Let us bring some earlier results together. 


Corollary 15.4.3. A compact metric space is second countable, and is 
therefore separable. 


Proof Proposition 15.3.4. 


Corollary 15.4.4 The completion of a totally bounded metric space is 
compact. 


Proof Proposition 15.3.5. 


This explains the terminology ‘precompact’. 


Corollary 15.4.5 A finite or countably infinite product of compact metric 
spaces, with a product metric, is a compact metric space. 


Proof Theorem 15.2.2. 


Corollary 15.4.6 The Hilbert cube and the Bernoulli sequence space Q(N) 
are compact. 


Proof Corollary 15.4.5. 
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We can characterize compactness in terms of the Hilbert cube, and in 
terms of the Bernoulli sequence space. 


Corollary 15.4.7. A metric space (X,d) is compact if and only if it is 
homeomorphic to a closed subspace of the Hilbert cube. 


Proof If (X,d) is compact, it is second countable (Corollary 15.4.3), and 
therefore there is a homeomorphism f of X onto a metric subspace f(X) of 
the Hilbert cube, by Urysohn’s metrization theorem (Theorem 13.5.6). But 
f(X) is compact, and so it is a closed subset of the Hilbert cube. Conversely, 
if (X,d) is homeomorphic to a closed subspace of the Hilbert cube, it must 


be compact. 


Theorem 15.4.8 A metric space is compact if and only if there is a 
continuous surjective mapping of the Bernoulli sequence space Q(N) onto X. 


Proof Since Q(N) is compact, the condition is sufficient. 

Suppose that (X,d) is compact. We give Q(N) the product metric 
P(Y, 2) = 524 lus — @yl/3?. If y € QUIN) and j E N,, let Cj(y) = M/s: (y). 
Then 


O;(y) ={z €Q(N): 3 = y; for 1 <i< j}. 


Such a set is called a j-cylinder set. Let C; be the set of j-cylinder sets: 
ICj| = 2’. 

We now show that there is a strictly increasing sequence (s;)7°, in N, 
and a sequence (fx : Cs, + X)?2, of mappings, such that 

(i) fr(Cs,) is a 1/2*-net in (X,d), and 

(ii) if y € Q(N) then d(fk(Cs,(Y)), farr1(Csny:(y))) < 1/2". 

We first define s; and f;. There exists a finite 1/2-net F) in (X,d). Choose 
$1 so that 2°: > |F)|. Since |C,,| = 2°', there is a surjective mapping of C,, 
onto F}. 

Suppose now that s1,...,s, and fy,...f; have been defined. For each 
C €C,, there is a 1/2**!-net Fi41(C) in Ny/ox(fr(C)). Choose ng 41 > rk 
so that 


arert > max{|Fe4i(C)|: C € Cz, }- 


Let $441 = 8% + Mg41, and let Cy41(C) be the set of s,41-cylinder sets con- 
tained in C: there are 2"*+' of them. There is therefore a surjective mapping 
fr+i.c from Cy41(C) onto Fi,41(C). Letting C vary, and combining the map- 
pings fz41,c, we obtain a mapping of C into (X,d) which satisfies (i) 
and (ii). 


Sk41 
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If y € Q(N) and k € N, let ge(y) = fr(Cs,(y)). Then each gy, is a 
continuous mapping of Q(N) into (X,d), and d(gx(y), ge4i(y)) < 1/2". Thus 
if k<l then d(gx(y),m(y)) < 2/2". It therefore follows from the general 
principle of uniform convergence that the mappings g, converge uniformly 
to a continuous function g mapping 2(N) to (X,d), and that d(g(y), gx(y)) < 
2/2" for y€ Q(N) and k EN. 

It remains to show that g is surjective. Since Q(N) is compact, g(Q(N)) is 
compact, and is therefore closed in X. It is therefore sufficient to show that 
g(Q(N)) is dense in X. Suppose that x € X and that « > 0. There exists 
k such that 1/2* < €/3, and there exists y € Q(N) such that d(x, 9z(y)) < 


1/2*. Thus d(x, g(y)) < d(x, gx(y)) + d(gn(y), g(y)) < 3/2* <e. 


Corollary 15.4.9 <A metric space is compact if and only if there is a 
continuous surjective mapping of the Cantor set C' onto X. 


Proof For the Cantor set is homeomorphic to 2(N). 


The next result is particularly important. 


Theorem 15.4.10 Jf f is a continuous map from a compact metric space 
(X,d) into a metric space (Y,p) then f is uniformly continuous. 


Proof Suppose not. Then there exists € > 0 for which we can find no suit- 
able 6 > 0. Thus for each n € N there exist x, 2/, in X with d(ap,x/,) <1/n 
and p(f(tn), f(x},)) => «. By sequential compactness there exists a sub- 
sequence (Xp, )72, which converges to an element x¢X as k— oo. Since 
d(Xn,)Lp,) +0 as k—->00, a), >a, as well. Since f is continuous at z, 
f(tn,) > f(a) and f(a},,) > f(x) as k > 00, so that p(f(an,), f(xh,)) 4 0 as 
k — oo. As p(f(an,), f(2',,)) = € for all k € N, we have a contradiction. 


Nk 


Isometries of compact metric spaces behave well. 


Theorem 15.4.11 Suppose that f is an isometry of a compact metric 
space into itself. Then f is surjective. 


Proof Suppose not. Then f(X) is a proper closed subset of X. Suppose 
that x € X\ f(X), and let 6 = d(z, f(X)). Ifn € N then f"(x) € f(X), and 
so d(f"(x), x) > 6. Since d is an isometry, if k € N then d(f"**(z), f*(x)) = 
d(f*(f"(x)), f*(a)) => 6, so that the sequence (f"(x))°, has no convergent 
subsequence. 


15.4.1 


15.4.2 


15.4.3 


15.4.4 


15.4.5 


15.4.6 
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Exercises 


Suppose that A and B are disjoint subsets of a metric space (X, d), 
and that A is compact and B is closed. Show that 


d(A, B) = inf{d(a,b): ae A,be B} >0. 
Suppose that (X,d) is a compact metric space. Let 
C= {x e€X: az has a countable neighbourhood} 


Show that C' is an open subset of X and that P = X \ C is perfect. 
By considering sets P, = {x € X : d(x,P) < 1/n}, or otherwise, 
show that C' is countable: X is the union of a countable set and a 
perfect set. 

Suppose that (X,d) is a compact metric space, and that (fn)°2o9 is 
a sequence of continuous functions from (X,d) into a metric space 
(Y,p), with the property that d(f,(x), fo(z)) is a decreasing null 
sequence, for each x € X. Show that f, converges uniformly to fo as 
n — oo. (This generalizes Dini’s theorem.) 

Give an example of a sequence (U;,)°2, of open subsets of R whose 
union contains the rationals and whose complement is infinite. 
Show that an open cover of a separable metric space has a countable 
subcover. (The previous exercise shows that some care is needed.) 
Suppose that (X,d) is a compact metric space, and that f is a map- 
ping of X into itself which satisfies d(f(x), f(y)) > d(x,y) for all 
x,y € X. By considering the sequence ((f"(x), f”(y)))p24, in X x X, 
show that if z,y € X and € > 0 then there exists n € N such that 
d(x, f"(x)) < € and d(y, f"(y)) < € . Show that f is an isometry, and 
give another proof that f is surjective. 


15.5 Compact subsets of C(K) 


Suppose that (K,d) is a compact topological space. Let C(A’) denote the 
(real or complex) vector space of continuous (real or complex) functions on 
K. If f € C(K) then f() is compact, and is therefore closed and bounded. 
Thus C(K) = C,(i), and C(K) is a Banach space under the uniform norm 
lf ll. = sup{|f(ax)| : 2 € kK}. What are the compact subsets of C(v)? Since 
C(k) is complete, a subset of C(/‘) is compact if and only if it is closed 
and totally bounded, and so it is enough to characterize the totally bounded 
subsets of C(x). 
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In order to do this, we need another definition. Suppose that (X,7T) is a 
topological space, that (Y,d) is a metric space, and that « € X. A set A of 
mappings from (X,7T) to (Y,d) is equicontinuous at x if, given € > 0, there 
exists N € N; such that d(a(x),a(y)) < € for alla € A and y € N. The set 
A is equicontinuous on X if it is equicontinuous at each point of X. 


Theorem 15.5.1 (The Arzela—Ascoli theorem) Suppose that (K,r) is 
a compact topological space, that (C(K),||.||,,) is the (real or complex) 
Banach space of continuous (real or complex) functions on K and that 
AC C(K). Then A is totally bounded if and only if A is bounded in norm 
and equicontinuous on K. 


Proof Suppose that A is totally bounded. Then A is bounded, by Propo- 
sition 15.3.1. Let us show that A is equicontinuous. Suppose that « € K and 
that « > 0. There exists a finite «/3-net F in A. For each f € F there exists 
Ny € Nz such that if y € Ny then |f(x) — f(y)| < €/3. Let N = NyerNy: 
N is a neighbourhood of x. Now if a € A there exists f € F' such that 
la — fll, < €/3. Thus if y € N then 


|a(x) — a(y)| < |a(x) — f(x)| + [f() — Fy) + 1F) — a) 
<¢€/3+6€/3+ 6/3 =€. 


Since this holds for all a € A, A is equicontinuous at 2. 

Conversely, suppose that A is bounded and equicontinuous. Suppose 
that «>0. If eK, there exists N(x) in N; such that if ye N(«#) then 
|a(x) — a(y)|<e/4 for all a € A. Since (K,7T) is compact, there exists a 
finite subset Y={yi,...yn} of K such that K = U"_,N(ym). We use 
Y to define a linear mapping of C(k) into R” (or C”): if f eC(K), we 
set T(f)=(f(y1),---f(yn)). We give R” (or C”) the supremum norm; 
artis ay eg ) Il ne = nae, [ty | Ls wp. Then 


ITP ylloo = sup [f(tm)| < IIflleo: 
1<m<n 


so that T(A) is bounded in R” (or C”). It is therefore totally bounded, by 
Theorem 15.1.10, and so there exists a finite subset F' of A such that T(F)) is 
an €/4-net in T(A). We shall show that F' is an e-net in A, so that A is totally 
bounded. If a€ A there exists f € F such that ||(£(a) — T(f)||,, <¢/4; that 
is, |a(Ym) — f(Ym)| <€/4 for 1 <m<n. Ifa eK there exists ym such that 
x € N(Ym); then |a(x) — a(ym)| < €/4 and |f(x) — f(ym)|<e¢/4. Putting 
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these inequalities together, 
la() ~ F(2)| < lal) ~ a(m)| + la(Gm) ~ Fm) + Lf Um) ~ F2) 
<€/4+6€/4+€/4 = 3€/4. 


Since this holds for all z € K, |la— f||,, < 3¢/4 < €, anda € N,(f). Thus 
F is an e-net in A. 


Corollary 15.5.2 A is compact if and only if it is closed, bounded and 
equicontinuous on K. 


It is possible to characterize the compact subsets of CA‘) locally. 

Suppose that (X,7) is a topological space and that Y is a subset of 
X. The restriction mapping my from C,(X) to Cy(Y) is defined by set- 
ting ty(f)(y) = f(y), for y € Y. my is a norm-decreasing linear mapping 
from (Cp(X), |]-||,,) to (Co(Y), |]-||,), and so is continuous. 


Theorem 15.5.3 Suppose that Mj,...,My are closed subsets of a com- 
pact topological space (K,7) and that kK = Ul_1M;. Then a subset A of 
(C(x), |||.) 18 compact if and only if my,(A) is compact in (C(M;), |l-||,) 
forl<j<n. 


Proof If A is compact, then 7y,(A) is compact, for 1 < 7 <n, since the 
mappings 7), are continuous. 
Conversely, suppose the condition is satisfied. Give the product space 
[Tj-1 C(M;) the norm 
(fis. +) Frdlloo = max All. - 


1<j<n 
If f € C(K), let w(f) = (nar, (f),---s 7a, (f)). Then 


(Flos = max, Ilr, (F) I, = eae If(@)) = IIflle- 
Thus a is an isometric linear mapping of (C(Kk),|].||,,) into 
Tja1(C UG), Il-I|,): In particular, 7(C(i)) is closed in Ia C(M;). 
It follows from Corollary 15.4.5 that ITj-1 m™u,(A) is compact in 
TTj=1(C(Y%)), |I- Io). Since 7(A) = (Ti mu, (A)) 7(C(4)), it follows that 
m(A) is compact. Since 7 is an isometry, A is compact. 


Exercises 


15.5.1 Suppose that (f,)°2, is an equicontinuous sequence of mappings 
from a topological space (X,7) into a metric space (Y,c), and that 
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15.5.2 


15.5.3 


15.5.4 


15.5.5 


15.5.6 


15.5.7 
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fn(x) converges pointwise to f(x) for each x in X. Show that f is 
continuous. 

Suppose that (f;,)°2, is an equicontinuous sequence of mappings from 
a topological space (X,7) into a complete metric space (Y,o), and 
that f,(d) converges to f(d) for each d in a dense subset D of X. 
Show that f,,(a) converges in Y for each x € X. 

Suppose that (K,7) is a compact topological space and that (fn)?24 
is an equicontinuous sequence in C(/‘), which converges pointwise to 
a function f. Show that f,, converges uniformly to f. 

Suppose that (f,)°2, is an increasing sequence of continuous real- 
valued functions on a compact topological space (X,7) which con- 
verges pointwise to a continuous function f. Show that the sequence 
(fn) is totally bounded in C(K). Use this to give another proof 
of Dini’s theorem (for functions on a compact topological space). 
Suppose that (X,d) and (Y, p) are metric spaces. A set A of mappings 
from (X,d) to (Y,p) is uniformly equicontinuous if, given € > 0, there 
exists 6 > 0 such that if d(a1,x72) < 6 then p(a(x1),a(x2)) < €, for 
all a € A. Thus each a € A is uniformly continuous, and we can 
control all the elements of A simultaneously. Suppose that (X,d) is 
compact. Show that an equicontinuous set of mappings A from X to 
Y is uniformly equicontinuous. 

Suppose that A C C[0,1] is equicontinuous and that {a(x) : a € A} 
is bounded, for some x € [0,1]. Show that A is bounded in norm, and 
is therefore totally bounded. 

Prove the following vector-valued version of the Arzela—Ascoli theo- 
rem. Suppose that (/,7) is a compact topological space, that (F, |].||) 
is a normed space and that A is a subset of C(K; £), ||.||,,), the (real 
or complex) normed space of continuous functions on K taking values 
in &. Show that A is totally bounded if and only if A is bounded, 
{a(x) : a € A} is totally bounded in F for each x € K, and A is 
equicontinuous on K. 


15.6 *The Hausdorff metric* 


(This section can be omitted on a first reading.) 


We now give an example of an interesting class of metric spaces. Suppose 


that (X,d) is a metric space. If A is a non-empty subset of (X,d), the open 
e-neighbourhood N,(A) is defined to be 


N,.(A) = {a € X : d(x, A) < €} = Urea N-(2); 
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since N.(A) is the union of open sets, it is open. Since d(x, A) = d(x, A), 


it follows that N.(A) = N,(A). For this reason, we concentrate on the non- 
empty closed subsets of X. Let Con(X) be the set of non-empty bounded 
closed subsets of (X,d); Con(X) is the configuration space of X. 


Theorem 15.6.1 Suppose that Con(X) is the configuration space of a 
metric space (X,d). If A,B € Con(X)and x € X, let 


dy(A,B) = sup |d(x, A) — d(x, B)]. 
rex 


(i) Let e(A, B) = supge, d(a, B). Then 
dy(A, B) = max(e(A, B),e(B, A)) < co. 


(ii) dy is a metric on Con(X). 
(iti) dy (A, B) = inf{e >0:ACN,.(B) and BC N,(A)}. 


Proof (i) First, e(A,B) = sup,e, d(a, B) — supge, d(a, A) < dy(A, B), 
and similarly, e(B, A) < dy(A, B), so that 


max(e(A, B),e(B,A)) < dy(A, B). 


Conversely, suppose that x € X and that ¢ > 0. There exists 6 € B such 
that d(x,b) < d(z,B) + «/2 and there exists a € A such that d(b,a) < 
d(b, A) + €/2. Then 


d(x, A) < d(z,a) < d(z,b) + d(b,a) < d(x, B) + d(b, A) +e 
< d(x, B) + e(B, A) +€. 
Since € > 0 is arbitrary, 
d(x, A) — d(z, B) < e(B, A) < diam (AU B) < 0, 
and similarly d(x, B) — d(x, A) < e(A, B) < co. Thus 
dy(A, B) < max(e(A, B),e(A, B)). 


(ii) Clearly dy(A, B) = dy(B, A) and dy(A, A) = 0. Suppose that A # B; 
without loss of generality we can suppose that A\ B 4. Ifa €¢ A\ B then 


dy(A, B) > d(a, B) — d(a, A) = d(a, B) > 0. 
If ¢ € X, and A, B,C € Con(X) then 
|\d(x, A) — d(x, C)| < |d(x, A) — d(x, B)| + |d(a, B) — d(x,C)| 
< dy(A, B) + dy (B,C); 
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thus dy(A,C) < dy(A, B) + dy(B,C), and so dy is a metric on Con(X). 
(iii) This follows immediately from (i). 


The metric dy is the Hausdorff metric on Con(X); it measures how far 
apart A and B are. 

Let Con,,(X) denote the set of subsets of X with n elements; Con,(X) 
is the n-point configuration space of X. Let Conp(X) = U%,Con,(X); 
Conpr(X) is the finite configuration space of X. 


Proposition 15.6.2 A closed bounded subset A of (X,d) is totally bounded 
if and only if it is in the closure of Conp(X). 


Proof If A is totally bounded and ¢ > 0 there exists a finite subset F’ such 
that A C UrerN-(x). Thus e(A, F’) < ¢. But we can clearly suppose, by 
discarding terms if necessary, that N-(2) A # Q, for each x € F,, and then 
e(F, A) < €; consequently dy (A, F’) < ¢, and A is in the closure of Conr(X). 

Conversely, suppose that A is in the closure of Conp(X), and that € > 0. 
Then there exists a finite set F' such that dy(A, F) < «. Thus AC N,(F), 
and A is totally bounded. 


The mapping iy : x > {x} : (X,d) > (Con(X),dy) is an isometry; 
further, Cony(X) = i(X) is a closed subset of (xy,dy), since if A € 
Con(X) has two distinct points a and a’ then 


e(A, {b}) > max(d(a, b), d(a’,b)) > d(a,a’)/2, 


by the triangle inequality. Note also that if Y is a closed subset of (X, d) 
then the natural inclusion: (Con(Y), dz) — (Con(X), dz) is an isometry. 
Properties of (X,d) are reflected in properties of (Con(X), dz). 


Theorem 15.6.3 Suppose that (X,d) is a bounded metric space. The 
following are equivalent. 

(i) (X,d) is totally bounded. 

(it) (Con(X), dz) is totally bounded. 

(itt) (Con(X), dz) is separable. 


Proof Suppose that (X,d) is totally bounded, and suppose that € > 0. 
Then there exists a finite subset F’ of X such that X = N,(F’). Suppose 
that A € Con(X). Let Fy = {x € F: N(x) NA FO}. Then F'4 is not 
empty, and dy(A,F4) < €. Thus if C(F) is the collection of the 2!”! — 1 
non-empty subsets of Ff’ then Con(X) = N,.(C(F)). Thus (i) implies (ii). 
Since (X,d) is isometric to a subset of (Con(X), dzz), (ii) implies (i). 

Since a totally bounded metric space is separable, (ii) implies (iii). Sup- 
pose that (X,d) is not totally bounded. Then, as in Theorem 15.3.7, there 
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exists € >0 such that there is no finite e-net in (X,d), and there therefore 
exists an infinite sequence (r,,)°2, such that d(tm,%n) > € for m 4 n. 
Let S={xr,:nE€N}, with the subspace metric. Then any subset of S is 
closed in (X,d), and if A and B are distinct non-empty subsets of S then 
dy(A, B) > e. Since there are uncountably many such sets, (Con(X), dz) is 
not separable. Thus (iii) implies (i). 


Theorem 15.6.4 If (X,d) is a metric space, (Con(X), dz) is complete if 
and only if (X,d) is complete. 


Proof The condition is necessary, since (X,d) is isometric to a closed 
metric subspace of (Con(X),dz). Suppose that (X,d) is complete, 
and that (A,)°2, is a Cauchy sequence in (Con(X),dyz). By Propo- 
sition 14.1.1, it is enough to show that (A,)°2, has a convergent 
subsequence. There exists a subsequence (By)?2)=(An,)?2, such that 
di (Br, Beri) < 1/2*, for k € N. We shall show that the sequence (By)%, 
converges. 

Let B= NR2,Mo/ox(Br), where M.(A) = {x € X : d(x, A) < €}. Since 
the mapping x — d(x, A) is continuous, M,(A) is closed, and so B is closed. 
Further, B C N3/9«(B), for k ¢ N. We show that B is non-empty, and that 
Br © N3/9x(B), for each k € N. Suppose that k € N and that x, € Bg. 
First, for 1 <1 < k there exists 2, € By with d(x), 2,) < 2/2!. Secondly, an 
inductive argument shows that for each | > k there exists x7; € B; such that 
d(aj_1,2;) <1/2'-1. Ifk <I <m then 


m m 


2 
d(zj,tm) < > d(zj-1,2;) < », 37-1 < a’ 


j=l+1 j=l41 
so that (a ;)?2, is a Cauchy sequence in (X,d). Let « = limj_,.. 2. Then 
d(x ,,x) < 2/2! for 1 > k, and 
d(aj,x) < d(ay, 2p) + d(ap, x) < 1/2' + 2/2* < 2/2! 


for 1 <1 <k. Thus x € B, and so B is not empty. Further, x, € N3/9x(x) C 
N3/x(B); this holds for any x, € By, and so By C N3/9x(B). Hence 
dy(B, By) < 3/2", and so By > B as k > oo. 


Combining this with Theorem 15.6.3, we have the following. 


Corollary 15.6.5 If (X,d) is a metric space, (Con(X),dx) is compact if 
and only if (X,d) is compact. 
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Exercises 


Suppose that (X,d) is a metric space. 

(a) Let Dp(X) = U%_,Con;(X). Show that D,(X) is closed in 
(Con(X), dz) and give an example to show that Con,(X) need 
not be closed. 

(b) Let p be a product metric on the product X". Show that the 
mapping s : (X", p) + (Con(X), dz) defined by 
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is continuous, and that s(X”) = D,(X). What is s~!(C,(X))?. 

(c) Define a relation on X” by setting « ~ y if there is a permu- 
tation o € Np, such that y; = Le(j) for 1 < 7 <n. Show that 
this is an equivalence relation. Let W,,(X) be the quotient space 
of equivalence classes; W,,(X) is the n-point weighted configu- 
ration space of X. Define a function dw on W,,(X) x W,,(X) 
by setting dyw(a,b) = inf{p(z,y) : « € a,y © b}. Show that 
this is a metric on W,,(X), and that the quotient mapping 
q: (X",p) > (W(X), dw) is continuous. 

(d) Show that there is a unique mapping s : W,,(x) > D,(X) such 
that s = $oq, and show that 5: (W,(z),dw) > (D,(X), dz) is 
continuous. 

Suppose that (X,d) is a complete metric space and that F is a 

finite set of contraction mappings of (X,d). If A € Con(X), let 

U(A) =U{T(A) : T € F}. Show that U is a contraction mapping of 

(Con(X), dz). 

Show that there is a unique A € Con(X) for which A = U{T(A) : 
T € F}, and that if B € Con(X) then U"(B) — Aas n- ow. Ais 
called the attractor of F. 

Let X = [0,1], Suppose that 0 < a, 8 < 1. If € [0,1], let La(x) = 
ax and let Rg(x) = 1— Bx. What is the attractor of {La, Rg} when 
a@ = 8 = 1/3? What is the attractor when a = 8 = 0? What is 
the attractor when 0 < a+ 6 < 1? What is the attractor when 
1l<a+8 <2? 


15.7 Locally compact topological spaces 


The Bolzano—Weierstrass and Heine—Borel theorems are useful in proving 


results in analysis on R, even though R is not compact. Every point in R 


has a compact neighbourhood, and R is the union of an increasing sequence 
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([—n,n])°2, of compact sets. We can consider topological spaces for which 
similar phenomena hold. A topological space space (X, d) is said to be locally 
compact if each point « of X has a base of neighbourhoods consisting of 
compact sets: if U is open and x € U then there exist an open set V and 
a compact set K such that c € V C K CU. Thus R is locally compact. 
So are R@ and C2, with product metrics. More generally, if (X,,d,) and 
(X2,d2) are locally compact, then so is X; x X2, with the product topology. 
A compact Hausdorff topological space is locally compact. Any set X, with 
the discrete topology, is locally compact. 
As usual, we shall concentrate attention on Hausdorff spaces. 


Proposition 15.7.1 A Hausdorff topological space (X,T) is locally com- 
pact if and only if each point has a compact neighbourhood. 


Proof The condition is certainly necessary. Suppose that it is satisfied, 
that « € X and that K is a compact neighbourhood of x. Let V be the 
interior of K, so that  € V C K. Suppose that U is an open neighbourhood 
of x in X. Then UMV is an open neighbourhood of x in K, for the subspace 
topology. Since K is compact and Hausdorff, it is normal, and so there exist 
a subset W of K, open in the subspace topology, and a subset L of K, closed 
in the subspace topology, such that x € W C L CUNV. Since K is compact 
and Hausdorff, L is compact. Since W is open in K there exists an open 
subset Y of X such that W=YOK. But thenW CYNVCYNK=W, 
so that W = Y 7V is open in X and x € W CL CU; (X,7) is locally 
compact. 


Corollary 15.7.2 Suppose that Y is a subspace of a locally compact 
Hausdorff topological space (X,T). 

(i) If Y is closed, then Y is locally compact. 

(ii) If Y is open, then Y is locally compact. 


Proof (i) If y € Y and M is a compact neighbourhood of y in X, then 
MY is a compact neighbourhood of y in Y. 

(ii) If y € Y then Y is an open neighbourhood of y in X, and so there 
exists a compact neighbourhood K of yin X with K CY. Then K = KNY 
is a compact neighbourhood of y in Y. 


Proposition 15.7.3 If (X,7) is a Hausdorff topological space and Y is a 
dense subspace of X which is locally compact in the subspace topology then 
Y is an open subset of X. 


Proof Suppose that y € Y. There exists an open subset U of Y and a 
compact subset K of Y such that ye U C K. Let W = X \ K. There exists 
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an open subset V of X such that U = VNY. We show that VW is empty. 
Since K is compact and X is Hausdorff, kK is closed in X, and soV NW is 
open in X. Suppose that V 1 W is not empty. Since Y is dense in X, there 
exists z €E YN(VNW) =UNW. But UNW C KNW =9. Thus VW = 9, 
so that V C kK. Thus U = VK =V, and U is open in X. Thus Y is open 
in X. 


Theorem 15.7.4 Suppose that (X,7) is a topological space. There exists 
a compact topological space space (X9,Too), @ point Loo of Xoo (the point 
at infinity) and a homeomorphism j of (X,T) onto the subspace Xo, \ {Loo} 
of Xoo. 


Proof We adjoin a point x to X, and set X~ = X U{xX}. We define a 
topology To, on Xo by saying that U € 7 if 


e UNX €7, and 
e if Zoo € U then X \ U is compact. 


Let us verify that this defines a topology. First, @ and Xoo are in Too. 
Suppose that U,,U2 © T.. Then Uj; NU2N X € 7; if Te € U, NU then 
Loo € Uy and rq € U2, so that X \ Uy and X \ U2 are compact, and 
X \ (U,NU2) = (X \ U1) U(X \ U2) is compact. Thus finite intersections of 
sets iN Too are iN Tog. Finally suppose that U C 7. Let W = UyeyU. Then 
WX =VUyer(UNX) € Tr. If to € W then there exists Up € U such that 
Loo € Up. Then X\W = (X \Uo)N(Nueu(X \U)). Since X \ Up is compact 
and Nuvey(X \U) is closed, X \W is compact. Thus arbitrary unions of sets 
IN Too aFe IN Tog, aNd Tog is a topology. 

Next we show that (Xo, 70) is compact. Suppose that U/ is an open cover 
of X... There exists Up € U such that rz. € Up, so that X \ Up is compact. 
Finitely many elements U;,...,U, of U cover X \ Uo, and so Up, U1,...Un 
cover X. 

The natural inclusion mapping j : (X,T) > (Xo, Too) clearly defines a 


homeomorphism of (X,7) onto the subspace Xo \ {Xoo}. 


A compactification of a topological space (X,7) is a compact topological 
space (x ,T), together with a homeomorphism j of (X,7) onto a dense sub- 
space of (X, T). The space (Xoo, To) is called the one-point compactification 
of (X,7). 

The one point compactification is only really useful when it is Hausdorff. 
When does this happen? 
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Theorem 15.7.5 The one point compactification (Xoo,Too) of a topolog- 
ical space (X,7) is Hausdorff if and only if (X,7) is locally compact and 
Hausdorff. 


Proof If (Xo0,To) is Hausdorff, then (X,7) is Hausdorff. Further, 
(Xoo, Too) is regular, and so is locally compact. If « € X then a subset 
N of X is a neighbourhood of x in X if and only if it is a neighbourhood of 
x in Xoo, and so (X,7T) is locally compact. 

Conversely, suppose that (X,7) is locally compact and Hausdorff. First, 
suppose that x and y are distinct points of X. Then, since (X, rT) is Hausdorff, 
then there are disjoint open sets U and V in 7 such that a € U andy € V. 
Since U,V € To, they separate x and y in (Xoo, Too). Secondly, suppose that 
x € X. Since (X,T) is locally compact, there exist an open set W and a 
compact set K in (X,7) such that « © W C K. Since (X,7) is Hausdorff, 
K is closed, and so Xx \ K € Too. Thus W and X, \ K separate x and 


Beg: 


What can we say about the compact subsets of a locally compact space? 


Proposition 15.7.6 Suppose that K is a compact subset of a locally com- 
pact space (X,7T). Then there exists an open set U and a compact set L such 
that K CUCL. 


Proof For each x € K there exists an open set U, and a compact set Lz 
such that x € U, C Ly. Then {U, : x € K} is an open cover of K, which 
has a finite subcover {U, : 2 € F'}. Take U = UgerUyz and L = UserLyz. 


A topological space (X,7) is o-compact if there is a sequence (K’,)°7°., of 
compact subsets of X such that X = U?2, Kn. If so, let J, = UF 3; then 
(Jn)°2, is an increasing sequence of compact subsets of X whose union is X. 


Corollary 15.7.7 If (X,7) is a o-compact locally compact topological 
space then X = UP2, In, where (Ln)°, is an increasing sequence of com- 
pact sets, with Ly, contained in the interior L),, of Lnyi, forn EN. If kK 
is a compact subset of X then K C Ly, for somen EN. 


oe) 


Proof We construct the sequence (L,,)°°, inductively. Suppose that X = 
Upe_, Kn, where the sets K,, are compact. Set Ly = K,. Having constructed 
Ly there is an open set U,41 and a compact subset D,41 such that D,UKy, C 
Un41 © Ln41. If K is compact, it is covered by the increasing sequence 
(L? )c2_, of open sets, so that K C L° C Lp, for some n. 


n=1 
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A sequence (L,,)°°, of compact subsets of a o-compact locally com- 
pact space (X,7) which satisfies the conclusions of this corollary is called a 
fundamental sequence of compact sets. 

What can we say about metrizable locally compact spaces? 


Theorem 15.7.8 Suppose that (X,rT) is a metrizable locally compact 
topological space. The following are equivalent. 

(i) (X,7) is separable. 

(it) (X, 7) is o-compact. 

(itt) (X,7) is second countable. 

(iv) The one-point compactification (Xo,Too) of (X,T) is metrizable. 

(v) (X,T) is homeomorphic to an open subset of a compact metric space. 


Proof Let d be a metric on X which defines the topology rT. 
Suppose that (i) holds. Let (x,)°°, be a dense sequence of elements of X. 
For each r let 


Cp ={n EN: Mij_(ar) = {x € X : d(x, ry) < 1/n} is compact}. 


We shall show that X = U%, (Unec,Mi jn (ar), so that (ii) holds. Suppose 
that x ¢ X. There exists n such that M/,,(x) is compact. There exists 2, 
such that d(z,x,) < 1/2n. Then Mj/9,(x-) C My/n(x), and so 2n € C,. 
Since  € My jon (tr), X = US2y (Unco, Mijn(r)). 

Suppose that (ii) holds. By Corollary 15.7.7, there exists an increasing 
sequence of compact sets (K,) with Ky, C Kp., © Kn41, for which X = 
Ure, K,. Each K,, is second countable, and so therefore is each Ky. Let U;, 
be a countable basis for the topology of K>. Suppose that U is a non-empty 
open subset of X. Then UM Kf is the union of countably many subsets 
in U,, and so U = UP, (UN Ke) is the union of countably many sets in 
u=U"_U,; U is a countable basis for the topology of (X,7). Thus (iii) 
holds. 

Since a second countable space is separable, (i), (ii) and (iii) are equiva- 
lent. Suppose that they hold. Let the sequence (K,,)°2, and U be as in the 
previous paragraph. Then UU {Xx \ Ky: n € N} is a countable basis for 
the topology To. Thus (X,7 0) is metrizable, by Urysohn’s metrization 
theorem: (iv) holds. 

Since (X,7) is homeomorphic to an open subset of (Xoo, Too), (iv) implies 
(v), and (v) implies (i), since a compact metric space is separable (Corollary 
15.4.3), and a subspace of a separable metric space is separable (Corollary 
13.5.4). 
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Exercises 


15.7.1 Construct a two-point compactification of R, and a compactification 
of R? which is homeomorphic to the closed disc {(x,y) € R? : 2? + 
y? <1}. 

15.7.2 Give an example of a countable product of locally compact spaces 
which is not locally compact. 

15.7.3 Suppose that (X,d) is a separable metric space. Show that (X,d) is 
locally compact if and only if there is continuous mapping of C \ {0} 
(where C’ is Cantor’s ternary set) onto X. 

15.7.4 Let X = {x € Hg : supjen |z;/j| < 1}, with the subspace metric. 
Show that X is the union of an increasing sequence of compact sets, 
but is not locally compact. 

15.7.5 Show that the intersection of two locally compact topological sub- 
spaces of a topological space is locally compact. 

15.7.6 Give an example of two locally compact subsets of R whose union is 
not locally compact. 

15.7.7 Give an example of a separable locally compact metric space which 
is not complete. 

15.7.8 Show that if (X,d) is a separable locally compact metric space then 
there is an equivalent metric p on X such that (X, ) is complete. 

15.7.9 Suppose that (X,d) is a separable locally compact metric space. Use 
the metric d to construct a metric d,, on the one-point compactifica- 
tion Xo. which defines the topology 7., avoiding the use of Urysohn’s 
metrization theorem. 
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Suppose that (X,7) is a a-compact locally compact topological space, and 
that (E, ||.||) is a Banach space. Let C(X, E) denote the space of continuous 
functions on X taking values in F. (An important example is the case where 
X is an open subset of the complex plane C, and E = C, and it is good 
to keep this example in mind.) Functions in C(X, F) need not be bounded, 
and uniform convergence is too strong for many purposes. If (f,)72, is a 
sequence in C(X,E) and f € C(X, E) then we say that f;, converges to f 
uniformly on compact sets, or locally uniformly, as k — oo if f,(x) > f(x) 
uniformly on K, for each compact subset K of X. Let us show that this 
convergence can be defined by a suitable metric topology on C(X, F). Let 


(Ly,)°2, be a fundamental sequence of compact subsets of X. 
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If K is a compact subset of X, let me : C(X,E) — C(K,E) be the 
restriction mapping (so that mx(f) = fix), and let tm = 71,. If m <n let 
Tm,n be the restriction mapping from C(Lp, FE) to C(Lm, E). If f € C(X, E), 
let w(f) = (tn(f))921; 7 is an injective linear mapping of C(X, E) into the 
product space [[°°., C(Ln,E). We give each space C(L,, E) its uniform 
norm, and we give the product [[72., C(Ln,E) the product topology r, 
defined by a suitable complete product metric p. 


Proposition 15.8.1 With the notation above, m(C(X,E)) is a closed 
linear subspace of [P< C(Ln, E). 


Proof The mapping 7 is linear, and so 7(C(X, E£)) is a linear subspace of 
[Tp C(En, E). If m < p then 


Gmp = {(g,h) € C(Lm, E) x C(Lp, E) : tm p(h) = g} 
is the graph of the continuous mapping 7), and is therefore a closed linear 


subspace of C'(Lm, E) x C(Lp, E). Consequently 


Emn,p = {(9n)ma=1 € II C(Ln, E) : Timp Gp) _ Om} 


n=1 


is a closed linear subspace of [[?°_, C(Ln, E). Since 


™(C(X, E)) = Oar (Ap>mHm,p) 5 


m(C(X, E)) is a closed subset of [[?°., C(Ln, E). 


Let us set d(f,g) = p(z(f),7(g)). Then d is a complete metric on C(X, E); 
for example, as in Section 13.3, we can take 


7 weet IF) = 9(@)ll_ 
US,9) = > a+ supicz, [F@) —a@) 


or d(f,g) = >> min(2™, sup || f(x) — 9(z)|))- 
n=l rely 


We call the corresponding topology on C(K,E) the topology of local 
uniform convergence. 


Proposition 15.8.2 Suppose that (X,T) is a a-compact locally compact 
topological space, and that (E,||.||) is a Banach space. Let (L,)°2, be a 
fundamental sequence of compact subsets of X. Suppose that (fx)724 is a 
sequence in C(X, EF), and that f € C(X, EF). The following are equivalent. 
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(i) fr(x) > f(x) uniformly on compact sets, as k + oo. 

(ii) fx(x) > f(x) uniformly on each Ly, as k + oo. 

(iit) For each x € X there exists a compact neighbourhood N, of x such 
that f,.(x) > f(x) uniformly on Ny. 

(iv) f(a) + f(x) in the topology of local uniform convergence. 


Proof Certainly (i) implies (ii) and (ii) implies (iii), since if ¢ € X then 
there exists n € N such that « € L7,, so that L, is a compact neighbourhood 
of x. If (iii) holds and if K is a compact subset of X, then for each x € K 
there exists a compact neighbourhood N, of x such that fx(xz) > f(x) 
uniformly on N,. The collection of sets {N° : x € K} is then an open cover 
of Kk, and so there is a finite subcover {N2 : « € F’}. Since f, > f uniformly 
on User Nz, it follows that f, — f uniformly on Kk. Thus (iii) implies (i). 
Finally, the properties of the product topology imply that (ii) and (iv) are 
equivalent. 


Theorem 15.8.3 (The general principle of local uniform convergence) 
Suppose that E is a Banach space and that (X,7) is a o-compact locally 
compact topological space. With the terminology as above, if (f)p21 is a 
sequence in C(X,E) then fy converges locally uniformly to a function f in 
C(X, E) if and only if (an (fr))721 is @ Cauchy sequence in (C(Ln, E), ||-\|,5); 
for eachn EN. 


Proof This follows immediately from the facts that convergent sequences 
are Cauchy sequences, and that the spaces (C(Ln, £), ||.||,,) are complete. 


We can characterize compact subsets of C'(X, £). A subset A of C(X, E) 
is locally uniformly bounded if 7x (A) is bounded in (C'(K, £), ||.||,,), for each 
compact subset K of X. 


Theorem 15.8.4 (The local Arzela—Ascoli theorem) Suppose that (X,7) 
is a o-compact locally compact topological space and that d is a complete 
metric on C(X) defining the topology of local uniform convergence. A closed 
subset A of C(X) is compact if and only if it is locally uniformly bounded 
and is equicontinuous at each point of X. 


Proof Of course we use the Arzela~Ascoli theorem. Let (Lp)? be a fun- 
damental sequence of compact subsets of X, and for each n let 7, : C(X) > 
C(Ly) be the restriction mapping. 

If A is compact, then 7,(A) is compact, and is therefore uniformly 
bounded on L,,, for each n € N; thus A is locally uniformly bounded. If 
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x € X and if x € LS then 7,,(A) is equicontinuous in C(L,,). Since x is an 
interior point of L,,, it follows that A is equicontinuous at x. 

Conversely, suppose that the conditions are satisfied. We show that A is 
sequentially compact. Suppose that (f,)72, is a sequence in A. The con- 
ditions imply that 7,,(A) is totally bounded in C(L,), for each n € N. 
A diagonal argument, then show that there exists a subsequence (fx, Vea 
such that (7n(fx;))F21 is a Cauchy sequence in (C(Zn),||-||.), for each 
n €N. Since (C(Ln),|I.||,,) is complete, it therefore follows that, for 
each n € N, (7n( Fes) Goa converges in norm to an element f(”), say, of 
C(Ln). Since f™(x) = f(x) for m < n and x € Lm, there exists 
a function f on X such that f(x) = f(x) for x € Ln, for each 
néN. If « € X then x € L? for some n, from which it follows that 
f € C(X). Thus (fz, am converges locally uniformly to f. Since A is closed, 
fea. 


Exercise 


15.8.1 Suppose that (X, 7) is a o-compact locally compact topological space, 
and that (£,||.||) is a Banach space. Let (L,)°°, be a fundamental 
sequence of compact subsets of X. Suppose that A C C(X, EF). Show 
that the following are equivalent. 

(i) A is locally uniformly bounded. 
(ii) 7(A) is bounded in C(Ly, E),||.||,,), for each n € N. 
(iii) For each « € X there exists a compact neighbourhood N, of x 
such that my,(A) is bounded in C(Nz, £), ||-||,,)- 
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Suppose that (£,||.||) is a normed space. When is E locally compact? Since 
the mappings ¢ > r+a:EF—> Eandz > Ax: E > E (A ¥ 0) are 
homeomorphisms, (£, ||.||) is locally compact if and only if the closed unit 
ball Bz = {x € E: ||z|| < 1} is compact. 

The space R%, with its usual Euclidean norm, is locally compact. We also 
have the following easy result. 


Proposition 15.9.1 Any linear operator T from R4, with its usual 
Euclidean norm, into a normed space (E,||.||) is continuous. 
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Proof If x = (#1,...,%q@) = 71e1 + -:-xgeq € R? then, using the triangle 


inequality and the Cauchy—Schwarz inequality, 


\[Z'(x) || = |la1T'(e1) +--+ eaT'(ea)|| < [aa]. Per) || +--+ + |xal- lP'(ea) 
1/2 / , 1/2 


d 
So ITesI? yer =K|z\l,, 
j=l 


j=l 


IA 


where K = omen IIT (e;) 7)”. 


Theorem 15.9.2 Any two norms on a finite-dimensional normed space 
E are equivalent. 


Proof Since a finite-dimensional complex normed space of dimension d 
can be considered as a real normed space of dimension 2d, we can suppose 
without loss of generality that FE is a real vector space. 

Let (fi,...,;fa) be a basis for E. Define a bijective linear mapping 
T : R¢ 3 E by setting T(z) = ae xjf;, and define ||T(x)||, = ||], 
where ||.|| is the Euclidean norm on R%. Then |j.||, is a norm on £, 
and T is an isometry of R®% onto (E£,||.||,). Since R% is locally com- 
pact, and complete, so also is (£,||.||,). It is sufficient to show that any 
norm |j.||, on EF is equivalent to |].||,. By Proposition 15.9.1, the iden- 
tity mapping (£,||.||,) — (£,]].||,) is continuous. Thus the function f : 
(E,||.||,) + R defined by f(x) = |x|, is continuous on (£,||.||,). The set 
S= {a € E: |x|], = 1} is compact, and so f attains its minimum on S at a 
point so of S. Since so 4 0,m = f(so) > 0. Thus if ||z||, = 1 then |/a||, > m. 
We now use a standard homogeneity argument. If x € E and x ¥ 0, then 
x/ \|x||, € S, and so 
«|| _ ely 
ltllol [elle 


Thus ||z||, < ||z||, /m for all « € E, so that the identity mapping (£, |].||,) > 
(E, ||.||,) is continuous. 


>m. 


Corollary 15.9.3 (i) Any finite-dimensional normed space is locally 
compact, and complete. 

(it) Any linear operator from a finite-dimensional normed space into a 
normed space is continuous. 

(iit) Any finite-dimensional subspace of a normed space (E,_||.||) is closed 
in E. 
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Theorem 15.9.4 Any locally compact normed space (F,||.||) is finite- 
dimensional. 


Proof Since Bg is compact, it is totally bounded. There therefore exists 
a finite subset F’ of Br such that 


BeCF+ Bp/2={f +a: f €F,|2||< 172}. 


Let G be the linear span of F’. G is a finite-dimensional linear subspace 
of E; we shall show that G = EF. Now Bg C G+ Bg/2, and so Bg C 
G+(G+ Bg/2)/2=G+ Bz/4. Iterating this argument, we see that Br C 
G+ Br/2”, forn €N. Thus if x € Bg andn EN, there exists g, € G such 
that ||a~ — g,|| < 1/2”. Hence g, — x as n — ov, and so, since G is closed in 
E, x €G=G. Consequently, Be C G, and so E = span(Br) CG. 


Exercises 


15.9.1 Suppose that F’ is a proper closed linear subspace of a normed space 
(E, ||-||). Suppose that y € E'\ F and that 0 < « < 1. There exists 
f € F such that |ly — f|| < d(y, F)/(1—). Let x = (y— f)/|ly — fll. 
Show that |||] = 1 and that d(z, F) > 1-e. 

15.9.2 Suppose that F is a finite-dimensional linear subspace of a normed 
space (F,||.||) and that y € EF \ F’. Show that there exists f € F 
such that ||y—f|| = d(y,F'). Show that there exists x € EF with 
||2|| = d(x, F) = 1. 

15.9.3 Suppose that (£,||.||) is an infinite-dimensional normed space. Show 
that there exists a sequence (%,)°°, of unit vectors in E with 
|v; —vz|| > 1 for j,k € N with 7 # k. Give another proof that 
(E, ||.||) is not locally compact. 

15.9.4 If @ € log, let Of) = oF tn /2”, and let H = {2 El, :d(z) = 1}. 
Let Hp = HM cp. 

(a) Show that ¢:1.. > R is a continuous linear mapping. 

(b) Show that H is closed in I. 

(c) Show that there exists a unique h € H such that d(0,h) 
doo(0, H). 

(d) Show that Hp is closed in cp. 

(e) Show that there is no element ho of Ho such that d.(0,ho) = 
dx (0, Ho). 

15.9.5 Suppose that F is a closed linear subspace of a normed space (£, ||.||) 
and that y € E \ F. Let G = span(F,y). Suppose that z € G and 
that fn + Anb > z as n > oo. Suppose that (An)°, does not con- 
verge. Show that there is € > 0 and a subsequence (Ap, )?2., such that 
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|Areei —An,| = €. Show that (an,,,—@n,)/(An,g—Anigi) 2 Yas 2 > 00, 
giving a contradiction. Show that if A, > A as n > oo then z € G. 
Deduce that G is closed. Deduce that if D is a finite-dimensional 
subspace of F then F'+ D is closed. 


16 


Connectedness 


16.1 Connectedness 


In Section 5.3 of Volume I we introduced the notion of connectedness of sub- 
sets of the real line, and showed that a non-empty subset of R. is connected if 
and only if it is an interval. The notion extends easily to topological spaces. 
A topological space splits if X = FU Fo, where F) and F» are disjoint non- 
empty closed subsets of (X,7). The decomposition X = FU F> is a splitting 
of X. If X does not split, it is connected. A subset A of (X,7) is connected if 
it is connected as a topological subspace of (X,7). If X = F) U F is a split- 
ting, then Fy = C(F)) and Fy = C(F)) are open sets, and so X is the disjoint 
union of two non-empty sets which are both open and closed; conversely if 
U is a non-empty proper open and closed subset of X, X =UU(X \U) is 
a splitting of (X,7). Thus (X,7) is connected if and only if X and 0) are the 
only subsets of X which are both open and closed. 


Proposition 16.1.1 Suppose that A is a connected subset of a topological 
space (X,T) and that X = F\ U F» is a splitting of X. Then either A C F 
or AC Fy. 


Proof The sets AN F, and AN F> are disjoint open and closed subsets of 
A whose union is A; one of them must be empty and the other one be equal 
to A. 


Proposition 16.1.2 Suppose that (X,r) and (Y,o) are topological spaces 
and that f : (X,T) + (Y,c) is continuous. If (X,7) is connected, then so is 
the topological subspace f(X) of Y. 


Proof If f(X) = F\UF) is a splitting of f(X) then X = f7'(F,)Uf7!(F) 
is a splitting of X. 
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Connectedness can be characterized in terms of continuous mappings. 
Let D2 be the two point set {0,1}, with the discrete topology. Then D2 = 
{0} U {1} is a splitting of Do. 


Corollary 16.1.3 A topological space (X,T) is connected if and only if 
there is no continuous surjective mapping of (X,T) onto Do. 


Proof The proposition shows that the condition is necessary. On the other 
hand, if X = Fo U F; is a splitting of X, and if x € Fj, let f(x) =7. Then f 
is a continuous surjection of X onto Do. 


We also have an intermediate value theorem. 


Corollary 16.1.4 (The intermediate value theorem) Suppose that f : 
(X,7) > R is a continuous real-valued function on a connected topologi- 
cal space (X,T), and that f(x) <u < f(x’) (where x,x' € X). Then there 
exists y © X such that f(y) =v. 


Proof f(X) is a connected subset of R, and so it is an interval. Since 
f(x), f(z’) € f(X) and f(x) <u < f(a’), it follows that v € f(X). 


Let us consider the connected subsets of a topological space (X, Tr) 


Proposition 16.1.5 Suppose that A is a set of connected subsets of a 
topological space (X,T) and that N 4c 4A 4 0. Then B = Use A ts connected. 


Proof Let cbean element of N4¢.4A. Suppose that Fis an open and closed 
subset of B. We must show that either F = B or that F = 0. Suppose first 
that ce F. If Ae A, then £7 A is a non-empty open and closed subset of 
A. Since A is connected, A = FMA C F. Since this holds for all A € A, 
BC F, so that F = B. Secondly, suppose that c ¢ F. If A € A, then 
F(A is a open and closed subset of A which is not equal to A. Since A is 
connected, FM A = @. Since this holds for all A € A, FN B = 9, so that 
P=, 


As an example, consider the open unit ball Ug = {x € E: ||a|| < 1} of 
a normed space. If x € Ug, let f, : [0,1] + Ug be defined by f(t) = ta. 
Then f, is continuous, so that f,([{0,1]) is connected. Since 0 € f,({0, 1]) for 
all x € Ug, the set Ug = U{fz([0, 1]) : c € Uz} is connected. It follows, by 
translation and scaling, that N-(x) is connected, for each x € E and € > 0. 


Proposition 16.1.6 Suppose that A is a dense connected subset of a 
topological space (X,T). Then (X,T) is connected. 
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Proof Suppose that F' is a non-empty open and closed subset of X. Since 
F is open and A is dense in X, FN A#Q. But F'N A is open and closed in 
A, and so 1 A= A, since A is connected. Thus A C F’. Since F is closed 
in X, X =AC F,andso F=X. 


Corollary 16.1.7 Suppose that A is a connected subset of a topological 
space (X,T) and that AC BCA. Then B is connected. 


Proof A is a dense subset of the subspace B of (X,7). 


Theorem 16.1.8 Jf (Xa,Ta)aca is a family of connected topological 
spaces, then the topological product X = ||,<-4(Xa;Ta) is connected. 


Proof Suppose that G is a non-empty open and closed subset of X, and 
that « € G. We shall show that if y © X then y € G, so that G = X, 
and X is connected. Suppose that 6 € A. The cross-section Cz, = {z € 
X : 2% = Lq fora # $} is homeomorphic to (Xg,7g), and is therefore 
connected. GM C;,g is an open and closed subset of Cz, containing x, and 
so GM Cz,3 = Cr,g. In particular, let w“)(8) = yg and w\ (a) = rq for 
a # 6; then w) € G. Iterating this argument, if F is a finite subset of A, 
let w" (8) = yg for 8 € F and w* (a) = zq for a ¢ F. Then w* € G. Now 
suppose that N is a neighbourhood of y. Then there exists a finite subset F’ 
of N such that N D {z € X : zg = yg for 8 € F}. In particular, w EN, 
so that NOG 4 0. Thus y € G. Since G is closed, y € G. 


We now consider the collection of connected subsets of a topological space 
(X,7). We define a relation on X by setting a ~ 0 if there is a connected 
subset A of (X,7) which contains a and b. This is clearly symmetric and 
reflexive (take A = {a}), and is also transitive, by Proposition 16.1.5, and 
so it is an equivalence relation on X. The equivalence classes are called the 
connected components of (X,T). 


Proposition 16.1.9 Suppose that E is a connected component of a 
topological space (X,7), and that x € E. Let 


G, =U{F CX :2€ F and F is connected}. 


Then EF = Gy. 


Proof If y © EF then there exists a connected subset Ff’ of X such that 
{z,y} C F, and so y € G,; thus E C G,. On the other hand, G, is 
connected, by Proposition 16.1.5, and so G, C E. 


Corollary 16.1.10 A connected component E of a topological space (X,7T) 
is closed and connected, and is a maximal connected subset of (X,T). 
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Proof  E is connected, since G, is connected. Since E is connected, E C 
G, = E, and so E is closed. If F' is a connected subset containing E then 
F CG,, and so F = E; E is a maximal connected subset of (X,7). 


Corollary 16.1.11 Jf X =GUH is a splitting of X, then either E CG 
or E CH. 


Example 16.1.12 Two distinct connected components of a Hausdorff 
topological space which cannot be separated by a splitting. 


We construct an example in R?. For n € N let C,, be the circle {(z, y) : 
r+y?=n/(n+1)}, let W =U%,C,, and let 


X = {(1,0)} U {(-1,0)} UW. 


Then the circles C;, are connected components in X, and so therefore are the 
singleton sets {(1,0)} and {(—1,0)}. Suppose that G is an open and closed 
subset of W which contains (1,0). Since G is open, there exists no such that 
(n/(n+1,0)) € G for n > no. Since each C;, is connected, C;, C G for n > no, 
and so (—n/(n +1),0) € G for n > no. Since G is closed, (—1,0) € G. Thus 
if F, U F> is a splitting of X, the two connected components {(1,0)} and 
{(—1,0)} must either both be in F;, or both be in F. 

Things work better for compact Hausdorff spaces. We need a preliminary 
lemma. 


Lemma 16.1.13 Suppose that G is a set of open and closed subsets of a 
compact Hausdorff space (X,7), and that J = NgegG. If J,UJ2 is a splitting 
of J, then there is a splitting Fi U Fo of X, with Jy C Fy and Jo C Fo. 


Proof The sets J, and J2 are disjoint closed subsets of X, and (X,7T) is 
normal, and so there exist disjoint open subsets U, and U2 of X with J; C U4 
and Jg C U2. The open sets Uj, Uz and {X \ G;G € G} cover X, and so 
there is a finite subcover U;, U2, X \Gj,...,X \Gy. Let Fy = UN (M1G;) 
and Fh = U2U( yae.¢ \G;)). Then Fi, U Fy = X, Fi and F» are open and 
disjoint, Ji Cc Fy and Jo C Fy. 


Theorem 16.1.14 Jf C and D are distinct connected components of a 
compact Hausdorff space (X,7T), there exists a splitting GUH with C CG 
and DCH. 


Proof Let G = {G: G is open and closed, and CC G}, and let J = Nceeg 
G. First we show that J is connected. Suppose not, and suppose that J = 
J, U Jo is a splitting of J. By Lemma 16.1.13, there exists a splitting X = 
F, UF of X with J; C Fy and Jo C Fy. But C is connected, and so C C Fy 
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or C' C Fo. Suppose, without loss of generality, that C C Fy. Then Fy € G, 
so that J C Fy. Thus Jp C F, 9 Fh = 0, giving a contradiction. 

Thus J is connected. But C' C J and C is a maximal connected subset of 
X, and so C = J. Suppose that d € D. Then there exists G € G such that 
d ¢ G. Since H = X \ G is open and closed and D is connected, D C H. 
Thus the splitting X = GU H has the required properties. 


When are connected components open? A topological space is locally con- 
nected if for each a € X and each N € N, there exists a connected M € N, 
with M C N;; that is, each point of X has a base of neighbourhoods con- 
sisting of connected sets. For example, a normed space is locally connected, 
since if € E then the sets {N.(x) : « > 0} form a base of neighbourhoods of 
x consisting of connected sets. An open subset of a locally connected space 
is clearly locally connected. 


Proposition 16.1.15 Jf (X,7) is a locally connected topological space then 
the connected components of X are open and closed. 


Proof Suppose that E is a connected component of X. E is closed, by 
Corollary 16.1.10. If a € E, there exists a connected neighbourhood N of a. 
Since E is the maximal connected subset of X containing a, N C E. Thus 
a € E*®. Since this holds for all a € E, E is open. 


Thus the connected components form a partition of X into connected 
open and closed sets. 


Corollary 16.1.16 Jf U is an open subset of a normed space (E,||.||), 
then the connected components of U are open subsets of E. 


Corollary 16.1.17 Jf (X,7) is a compact locally connected topological 
space then there are only finitely many connected components. 


Proof For the connected components of X form an open cover of X. 


Corollary 16.1.18 Jf (X,7) is a separable locally connected topological 
space then there are only countably many connected components. 


Proof Let D be a countable dense subset of X. If d € D let Cq be the 
connected component to which d belongs. If C is a connected component, 
then C'N D is not empty, and so the mapping d — Cy is a surjection from 
the countable set D onto the set of connected components. 


In particular, and this will be very important later, an open subset of C 
is the countable union of disjoint open connected sets. 
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Proposition 16.1.19 Jf (X,r) = []j-1(Xn,7™m) is a finite product of 
locally connected topological spaces then (X,T) is locally connected. 


Proof Ifa €X then the sets 


{N = Ni x--: x N,: Nj; is a connected neighbourhood of «;} 


form a base of neighbourhoods of x consisting of connected sets. 


On the other hand, an infinite product of locally connected topological 
spaces need not be locally connected. If X, = {0,1}, with the discrete 
topology, for all n € N, then X,, is locally connected, while the connected 
components of (X,7) = [][°°,(Xn,™) are the singleton sets. But we have 
the following. 


Proposition 16.1.20 If (X,7) = ]]gc4(Xa; Ta) is a product of connected 
locally connected topological spaces then (X,T) is locally connected. 


Proof This is left as an exercise for the reader. 


Exercises 


16.1.1 A point x in a connected topological space (X,7) is a splitting point 
of X if the topological subspace X \ {x} is not connected. Determine 
the splitting points of the spaces (0,1), (0, 1], [0,1] and [0,1] x [0,1]. 
Show that no two of them are homeomorphic. 

16.1.2 Show that there is no continuous injective map from R? into R. 

16.1.3 Suppose that A is a closed subset of [0,1] x [0,1] for which each 
cross-section AM ({z} x [0,1]) is a non-empty interval. Show that 
there exists 0 < # < 1 such that (2,2) € A. 

16.1.4 Suppose that A is a set of non-empty subsets of a set S. A is linked 
if whenever A, B € A then there exists a finite sequence (Ag, ... An) 
such that Ag = A, A, = B and Aj_1 A; £0 for 1 <j <n. Show 
that if A is a linked set of connected subsets of a topological space 
(X,7) then U4e4A is connected. 

16.1.5 Suppose that (X,d) is a compact metric space for which N-(x) = 
M.(x) for each « € X and € > 0. Show that the sets N.(x) and 
M(x) are connected, for each « € X and € > O. In particular, 
(X,d) is connected and locally connected. 

16.1.6 A metric space (X,d) is well-linked if whenever a,a’ € X and e > 0 
there exists a finite sequence (aj )9 in X with aj = a,an =a’ and 
d(a;1, 04) < efor 1 <7 <n. 

(a) Show that a connected metric space is well-linked. 
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(b) Give an example of a well-linked metric space which is not 
connected. 
(c) Show that a compact well-linked metric space is connected. 
16.1.7 Show that a countable Hausdorff topological space with more than 
one point is not connected. 
16.1.8 Prove Proposition 16.1.20. 
16.1.9 Suppose that the compact metric space (X,d) has the property 
that if z,y € X then there exists z € X with d(z,z) = d(y,z) = 
d(x, y)/2. Show that (X,d) is connected. 

16.1.10 Suppose that (C,)°°., is a decreasing sequence of closed connected 
subsets of a compact topological space. Show that M°2,C), is 
connected. 

16.1.11 Give an example in R? of a decreasing sequence (C;,)°~, of closed 
connected sets whose intersection is not connected. 


16.2 Paths and tracks 


Suppose that (X,7) is a Hausdorff topological space. Let us consider the set 
Cx ([a, b]) of continuous mappings from the closed interval [a,b] into (X,7). 
An element f of Cx({a, 6]) is called a path in X. f(a) is the initial point of 
the path, and f(b) is its final point, and f is a path from a to b. The image 
f(a, 6]) is called the track from f(a) to f(b), and is denoted by [f]. It is a 
compact connected subset of (X,d). It can be helpful, if not mathematical, 
to think of the interval [a,b] as a time interval; we start at time a at f(a); 
f(t) denotes the point of the track that we have reached by time t, and we 
reach f(b) at time b. We may retrace our footsteps, or cross the path at a 
point that we have reached before, and, as we shall see, many other strange 
things can happen. A path is closed if f(a) = f(b); we return to our starting 
point. 

Before going any further, let us give a word of warning: there is consid- 
erable variation in terminology. Different authors use words such as ‘path’, 
‘track’, ‘curve’ and ‘arc’ with a variety of different meanings. 

As a simple example, if a and 6 are elements in a normed space (£, |].||) 
then the linear path o(a,b) : [0,1] > E from a to 6 is defined as o(a, b)(t) = 
(1—t)a+tb, for t € [0,1]; its track is denoted by [a,b], and is called the line 
segment from a to b. 

We can jurtapose two paths to obtain a new path: if f : [a,b] > X and 
g: [c,d] > X are paths, and f(b) = g(c) we define f V g to be the path 
from [a,b + (d —c)] into X defined by f V g(x) = f(x) for x € [a,b] and 
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f V g(x) = g(x + (c— b)) for x € [b,b + (d—c)]. Thus f V g has initial point 
f(a) and final point g(d), and [f V g] = [f] U [g]. 

The juxtaposition of finitely many linear paths is called a piecewise-linear 
or polygonal path. If y = a(vp, v1) V...Vo(vg_1, Vg) is a polygonal path, then 
the points vg, v1,-..,Uz% are called the vertices of y, and the line segments 
[vj-1,v;] are called the edges of y. A polygonal path y in R? is called a 
rectilinear path if its edges are parallel to the axes: that is, if 1 <7 < k then 
all but one of the coordinates of vj;_; and v; are the same. A polygonal path 
in R¢ is called a dyadic path if each coordinate of each vertex is a dyadic 
rational number. Dyadic rectilinear paths will be very useful, since we can 
apply counting arguments to sets of dyadic rectilinear paths. 

We can reverse a path. If f : [a,b] > X is a path, we set f* (t) = f(a+b—t) 
for t € [a,b]. Then f* is a path, the reverse of f, with initial point f(b) and 
final point f(a), and with [f*] = [/]. 

We can define an equivalence relation on paths. If f : [a,b] ~ X and 
g:|c,d| > X are paths, we say that f and g are similar paths, or equivalent 
paths, if there exists a homeomorphism ¢ : [c,d] + [a,b] such that ¢(c) = a, 
o(d) = b and g = f o¢. It is easy to see that this is an equivalence relation. 
Recall that ¢ : [c,d] — [a,b] with ¢(c) = a and $(b) = dis a homeomorphism 
if and only if it is a strictly increasing continuous function. Similar paths 
have the same track, and if g(s) = fo ¢, we can think of t = ¢(s) asa 
change of variables or reparametrization. For example, if f : [a,b] > X is 
a path, and we set g(t) = f((1 —t)a+ tb) for t € [0,1] then g: [0,1] > X 
is a path equivalent to f. Again, if f : [0,1] > X is a path, and we set 
g(t) = f(t?) for t € [0,1] then g : [0,1] — X is a path equivalent to 
f. For many purposes, equivalent paths play the same role, and we shall 
frequently identify equivalent paths. Thus it is often convenient to consider 
paths defined on [0, 1] or [0, 27]. 

We can change the initial point of a closed path. Suppose that f : [a,b] > 
X is a closed path and that s € [a,}]. Define f, : [a,b] + X by setting 


fs(t) = f(s—a+t) fora<t<a+b-s, 
and f,(t) = f(t +s—b) fora+b—s<t<b. 


Then f, is a closed path with initial point f(s) and with the same track as 
f. Two closed paths f : [a,b] > X and g: [c,d] ~ X are similar closed 
paths if there exists s € [a,b] such that f, and g are similar paths. 

A path f : [a,b] > (X,7) is simple if f is an injective mapping from [{a, }| 
into X. A simple path is sometimes called an arc. In this case, since [a, }] 
is compact and (X,7) is Hausdorff, f is a homeomorphism of [a,b] onto the 


472 Connectedness 


track [f]. Suppose that f : [a,b] > X and g: [c,d] > X are simple paths 
with the same track, and with the same initial points and final points. Let 
f-' : [f] - [a, 6] be the inverse mapping, and let y = f~!og. Then y is a 
homeomorphism of [a,b] onto [c,d] with y(a) = c and y(b) = d, and so f 
and g are equivalent. 

A simple closed path f : [a,b] + X is a closed path whose restriction 
to [a,b) is injective. Thus a simple closed path is not a simple path, since 
f(a) = f(b), but otherwise f takes different values at different points of [a, b]. 
For example, the mapping k : [0,27] + R? defined by «(t) = (cost, sint) is a 
simple closed path, with track the unit circle T = {(a,y) € R?: ||(z,y)|| = 
1}. More generally, if w = (a,y) € R? and r > 0, then the circular path 
k,(w) is defined as 


Kr(w)(t) = (a + rcost,y+rsint) for t € [0, 27]; 


we denote its track by T,(w). Suppose that f : [0,27] > X is a simple 
closed path. Let h = fo«~!. Then h is a homeomorphism of T onto [f]; the 
track of a simple closed path is homeomorphic to the unit circle. 

In order to illustrate the nature of paths, let us establish some approxi- 
mation results that we shall need later. 


Theorem 16.2.1 Suppose that y: [0,1] > U is a path from a to b in an 
open subset U of a normed space (E,||.||,;), and that 6 > 0. Then there is a 
polygonal path B :{0,1] > U from a to b with |B(t) — 7(t)| < 6 for t € [0,1]. 


Proof Since [y]| is compact, we can suppose, by taking a smaller value of 
6 if necessary, that N5([y]) = U{Ns(z) : z € [y]} C U. Since ¥ is uniformly 
continuous on [0,1] there exist 0 = to < t) < --: < t, = 1 such that 
ly(t) — y(t3)| < 6/2 for t € [t;-1,t,;], for 1 < j < k. We consider the 
polygonal path with vertices to,...,t,. Let 


B = o(y(to), V(t1)) V o(y(t1), V(t2)) V «oC (te-1), Vte)), 


parametrized so that 8(t;) = y(t;) for 0 < 7 < k. Then [6] C U, since 
[y(tj-1), V(ty)] S No (tj) CU. If tj1 <t < tj then 


FQ) -— Alle < NEO) — Balle + Iya) — Vlg < 4 


Corollary 16.2.2 Jf E = R4%, then 6 can be chosen as a rectilinear path. 


Proof Replace each of the paths o(7(tj;-1), y(t;)) by the juxtaposition of 
finitely many linear paths, each parallel to an axis. 
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Corollary 16.2.3 Suppose that y: [0,1] + U is a path in an open subset 
U of R4 and that 5 > 0. Then there is a dyadic rectilinear path 8 : [0,1] > U 
with |B(t) — y(t)| < 6 for t € [0,1]. 


Proof Approximate the vertices of the path of the previous corollary by 
vertices each of whose coordinates is a dyadic rational number. 


16.3 Path-connectedness 


A topological space (X,7) is path-connected if for each x,y € X there is a 
path from x to y. 


Proposition 16.3.1 Jf (X,7) is a path-connected topological space then 
(X,7) is connected. 


Proof Suppose that F' is a non-empty open and closed subset of X, and 
that « € F. If y © X there exists a path from x to y. Let T be its track. 
Then « € T and T is connected, and so T C F.. But y € T, and so y € F. 
This holds for all y € X, and so F = X; X is connected. 


Example 16.3.2 A connected compact subset of the plane R? which is 
not path-connected. 


Let 


f={(0,0)2—-lL<y7 <1), J={isml/z))s0<2< 1), Ket”, 
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Figure 16.3. A connected set which is not path-connected. 


474 Connectedness 


The mapping (z,sin(1/z)) — «wv is a homeomorphism of J onto (0, 1], 
so that J is connected. K is the closure of J, so that K is compact 
and connected. We show that K is not path-connected. Suppose that 
f : [a,b] — K is a path from (0,0) to (1,sin 1) and set f(t) = (g(t), A(t)). Let 
S = sup{s € [a,}] : g(s) = 0}. By continuity, g(s) = 0, and soa < s < b. 
Using the intermediate value theorem inductively, we can find a decreasing 
sequence (tp,)?°.9 in [s, b] such that g(t,) = 2/m(2n+1). Let T = limp-+oo tn. 
Then A(t,) = sin((2n + 1)a/2) = (—1)", so that h(t) does not converge 
to h(T) as n > oo, contradicting the continuity of h. Note also that J is 
path-connected, but that J = K is not. 

Nevertheless, as we shall see, path connectedness provides a valuable test 
for connectedness. For example, a normed space (F, ||.||) is path-connected, 
and therefore connected: if x,y € F, let f(t) = (1 — t)x + ty, for t € [0,1]. 
Here the track is the line segment [, y]. 

We can also partition a topological space into path-connected compo- 
nents. For this, we need the following. 


Proposition 16.3.3 Suppose that (X,7T) is a topological space. Define a 
relation ~ on X by setting x ~ y if there exists a path in X from x to y. 
Then ~ is an equivalence relation on X. 


Proof Ifa €X, let f(t) =< for x € [0,1]. Then f is a path from z to z, 
and sox~ @. 

If f : [a,b] — X is a path from z to y, the reverse path f* is a path from 
y to x. Thus ifa~y then y~ a. 

If f isa path from x to y and g isa path from y to z then the juxtaposition 


fVgisa path from z > z. Thus ifa#~yandy~ z then r~ z. 


The equivalence classes are called the path-connected components of X. 
They are maximal path-connected subsets of X. Path-connected components 
need not be closed. If K is the example that we have given of a connected 
space that is not path-connected, then the path-connected components are 
I, which is closed, and J, which is not. 

We can also define local path connectedness. A topological space is locally 
path-connected if for each x € X and each M € N, there exists N € N; such 
that N C M and N is path connected; that is, each point of X has a base of 
neighbourhoods consisting of path-connected sets. Of course, a locally path- 
connected space is locally connected, and its path-connected components 
are connected. An open subset U of a normed space (£, ||, ||) is locally path- 
connected; if  € U and N,(x) C U then N,(x) is path-connected. 
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Proposition 16.3.4 Jf (X,7) is a locally path-connected topological space, 
then the path-connected components are open and closed. 


Proof Suppose that EF is a path-connected component of X and that x € 
E. Let N bea path-connected neighbourhood of X. Then x € N° CN CE, 
and so E is open. The complement of FE is the union of path-connected 
components, and therefore it is open. Thus EF is closed. 


Corollary 16.3.5 A locally path-connected space is connected if and only 
if it is path-connected. 


In particular an open subset of a normed space, or of C, is connected if 
and only if it is path-connected. 

Results concerning the path-connectedness of product spaces are easier 
to prove than the corresponding ones for connectedness. 


Theorem 16.3.6 Jf (X,7) = []ye4(Xa,Ta) is the product of path- 
connected topological spaces (Xq,Ta) then (X,7T) is path-connected. 


Proof Suppose that x,y € X. Then for each a € A there exists a path 
fa : [0,1] — X from Zq to Yq. Let f(t) = (fa(t))eea. Then f is a path from 
x to y. 


Corollary 16.3.7 If (X,7r) is a finite product of locally path-connected 
topological spaces then (X,7) is locally path-connected. 


The example (the infinite product of two-point sets) that we gave for local 
connectedness shows that this result does not extend to infinite products. 


Exercises 


16.3.1 Suppose that (X,d) is a path-connected metric space. Show that the 
set F(X) of finite non-empty subsets of X, with the Hausdorff metric, 
is path-connected. 

16.3.2 Suppose that (X,d) is a path-connected locally path-connected com- 
pact metric space. Show that the configuration space C(X), with the 
Hausdorff metric, is path-connected. 


16.4 *Hilbert’s path* 


(This section can be omitted on a first reading.) 


The definition of a path, and its track, are very straightforward and nat- 
ural. Paths are however not at all straightfoward. In 1890, Peano gave an 
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example of a path in the plane whose track is the unit square [0, 1] x [0,1]. 
We shall construct a path, Hilbert’s path, with the same track; this was 
described by Hilbert in 1891. 

We start with the unit square [0,1] x [0,1], which we list as 5, It can be 
divided into a set Q; of four squares with side-length 1/2, namely [0, 1/2] x 
(0, 1/2], (0, 1/2]x[1/2, 1], [1/2, 1] 0, 1/2] and [1/2, 1]x[1/2, 1]. We can divide 
each of these squares into four smaller squares, and iterate the procedure. 
Thus at the nth level we have a set Q,, of 4” squares of side-length 1/2”. 
Note that there is a natural parity on Q,; if S = [(j — 1)/2”,7/2"] x [(k- 
1)/2", k/2"], we say that S has odd parity if j +k is odd, and even parity if 
j +k is even. If we colour the squares with odd parity white, and those with 
even parity black, then we have a checker-board colouring. Note that if two 
squares in Q, share a common side, then they have different parities. 

We shall show that there is a unique listing (see of Q, forn € N 
such that 


(i) zo = (0,0) € 3”) and #1 = (1,0) € si). 
(ii) s and oo share a common side, for 1 < 7 < 4"; 
Ces 7 Us Us Us inter 


for all n € N. Note that if we have such a listing, then ce has odd parity 
if 7 is even, and even parity if 7 is odd. 

To show this, note, by considering possible cases, that if we divide a square 
S into a set QO of four squares with equal side-length, and if T € Q and € is 
a side of S, then there is a unique listing (Ti )fa1 of Q so that consecutive 
terms have a common side and so that T; = T and T;nNe #0. 

We begin by listing Q1: we set 


gs = [0, 1/2] x [0, 1/2], 
gs = [0, 1/2] x [1/2, 1], 
si = [1/2, 1] x [1/2, 1], 
st = [1/2,1] x [0, 1/2]. 


This satisfies the conditions, and is the only listing that does so. . 
nm 


Suppose that Q, has been listed. Let an be the common side of Si 
and Se for 1 < j < 4”. We list the four elements of Q,4,1 contained 


in 3”) so that consecutive terms have a common side and so that x9 € 
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git) and srr) al el”) # (); this listing is unique. We then set get) 


to be the element of 3) which has a side in common with Ca and 
iterate. We continue in this way until we reach Crees which is contained 


in gi”). up to here the listing is unique. Now Sons has even parity, and 


[1 —1/2"*1, 1] x [0,1/2"*1], the element of Q,4 to which x1 belongs, has 
odd parity, and so we can complete the listing to satisfy the conditions, and 
in a unique way. 

We now use these listings to define approximations h,, to Hilbert’s path. 
Let a”) = £0, let ri be the centre of the square s” for 1 <j < 4” and 


let of) = x1. Let 
to = 0, let t; = (jg —1/2)/4” for 1 <7 <4”, and let tan4, = 1. 


Then set 

fon((1 — A)ty + Atj4a) = (1 Aah” + As, 
for0 <A <1land0 <j < 4”. Then h, is a simple path from 29 to x1 
which spends equal time 1/4” in each of the squares of Q, in turn. Because 
of condition (iii), if (j — 1)/4" < t < j/4” then h,(t) € S\”) and also 


hm(t) € Che for allm > n. 


(0,1)! 


(0,0) (1,0) 


Figure 16.4. The paths Hz and Hy. 
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Thus ||hn(t) —hn(t)|lp < V2/2". Since this holds for all t € [0,1], 
[Pn —hmllog < V2/2", and (Rn)&1 is a Cauchy sequence in C2((0, 1). 
Since CR2([0,1]) is a Banach space, the uniform limit h is a path from 2 
to 21. We must show that [h] = [0,1] x [0,1]. If not, then, since [A] is a 
closed subset of [0,1] x [0,1], its complement is a non-empty open subset of 


) 


[0,1] x [0,1], and this contains a closed square che , for some j and n. But 


if (j —1)/4" <t < 5/4” then Am(t) € S”) for m > n. As hm(t) > h(t) as 
m —> oo, it follows that h(t) € o.. giving a contradiction. 

Hilbert’s path has a great deal of self-similarity. For example, the mapping 
c: (x,y) > (y/2,x2/2) maps (0, 1] x [0, 1] onto [0, 1/2] x [0, 1/2], and c(f(t)) = 
f(t/4) for 0 < ¢ < 1. Similarly if b(z,y) = (#/2, (y+ 1)/2) then b(f(t)) = 
f((t+0)/4). 

The examples of Peano and Hilbert overturned many intuitions about 
dimension: a two-dimensional object can be the continuous image of a one- 
dimensional one. But the consequence of this was the development of a 
rich theory of dimension. One other conclusion that should be drawn from 
this example is that the notion of continuity is not a straightforward one; 
the sketches that we make when we consider continuous functions are quite 
untypical of what can happen. 

Hilbert’s path is interesting because of its explicit construction and its self- 
similarity properties. In the next section, we shall establish a more general 
result. 


Exercise 


16.4.1 Construct a Hilbert path in R?, and in R@ for dé N. 


16.5 *More space-filling paths* 


(This section can be omitted on a first reading.) 


We now show that many compact metric spaces are the tracks of 
continuous paths. 

We can express the local path-connectedness of a compact metric space 
in a uniform way. 


Theorem 16.5.1 Suppose that (X,d) is a compact metric space. Then 
(X,d) is locally path-connected if and only if, given € > 0, there exists n > 0 
such that if d(x,y) < 1 there exists a path f : [0,1] > X from x to y such 
that d( f(s), f(t)) <¢ forO<s<t<l. 
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Proof The proof is a standard compactness proof. Suppose that (X, d) 
is locally path-connected and that « > 0. For each a € X there exists 
6(a) > 0 and a path-connected set Cy such that N5(q)(@) © Ca © Ne/2(2). 
Thus if b,c € Ngq)(a) then there is a path f : [0,1] + X from 6 to ¢ 
with d(f(s), f(t)) < € for 0 < s < t < 1. The sets No(qy/2(a) form an 
open cover of X, and so there is a finite subcover {N5(q)/2(@) : a € F}. Let 
n = min{d(a)/2: a € F}. If d(x, y) < 7 then there exists a € F such that 
x € N5q)/2(a), and so y € Ngiq)(a). Thus there is a path from x to y with 
the required properties. 

Conversely, suppose that the condition is satisfied, and that N.(x) is a 
neighbourhood of x in X, and let 7 satisfy the condition. Let C.(x) be 
the set of points y in X for which there is a path f : [0,1] ~ X from 
x to y for which d(f(s), f(t)) < € for 0 < s < t < 1. Any such path is 
contained in N,(x) (take s = 0), so that C.(x) C N.(x), and the condition 
implies that N,(x) C C.(x). Since C.(a) is clearly path connected, the sets 
{C.(x) : € > 0} form a base of path-connected neighbourhoods of «. 


Corollary 16.5.2 If (X,d) is a locally path-connected compact metric 
space there exists an increasing real-valued function h on (0,diam X], for 
which h(u) + 0 as u \ 0, such that if x,y € X there exists a path 
f : [0,1] ~ X from « to y for which d(f(s),f()) < h(d(z,y)) for 
O0<s<t<l. 


Proof Ifz,y € X, let P(x,y) be the set of paths from zx to y. Let r(x, y) = 
inf{diam ([p]) : p € P(z,y)}; then d(z,y) < r(z,y) < diam(X). If 0 < 
u < diam(X) let h(u) = 2sup{r(z,y) : d(z,y) < u}. Then the theorem 
implies that h(u) + 0 as u \, 0, and the construction ensures that the other 
requirements of the corollary are satisfied. 


Theorem 16.5.3 Suppose that (X,d) is a locally path-connected compact 
metric space. Then there exists a path in X whose track is X. 


Proof Let h be a function satisfying the conditions of Corollary 16.5.2. 
By Corollary 15.4.9, there exists a continuous surjective mapping f from the 
Cantor set C' onto X. We extend this to a continuous mapping ae [0,1] > X. 
Suppose that (c,d) is a connected component of [0,1] \ C. There is a path 
y:|ce,d] > X from f(c) to f(d) such that 


d(y(s), y(t) S$ h(d(F(©), F(@))) fore <s<t<d. 


We define f(s) = 7(s) for s € (c,d). 7 
The function f maps (0, 1] onto X; it remains to show that f is continuous. 
If s € [0,1] \\C then f is continuous at s, since the path y is continuous at s. 
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We must show that 7 is continuous at each point t of C’. This takes just a 
little care. It is enough to show that fis continuous on the right on [0, 1) and 
continuous on the left on (0, 1], and it is clearly sufficient to show the former. 
If t is the left-hand end-point of a connected component of [0, 1]\C, then fis 
continuous on the right at t. Otherwise, suppose that € > 0. Then there exists 
6 > 0 such that h(u) < €/2 for 0 < u <4. Since f is continuous on C, there 
exists 0 < 7 < 6 such that ¢ +7 < 1 and |f(r) — f(t)| = |f(r) — f()| < €/2 
for r € (t,t +7) NC. Since t is not the left-hand end-point of a connected 
component of [0,1] \ C, there exists t/ € (t,t +7)NC. Ift < s < t/ then 
either s € C, in which case | f(s) — f(t)| < €/2, or s € [0,1]\C. In the latter 
case, there is a connected component (c,d) of [0,1] \ C for which s € (c,d). 


But then (c,d) C (t,t), so that d—c < 6, and |f(s) — f(e)| < €/2. Thus 


| f(s) — f(t)| < € fort <s <1; f is continuous on the right at t. 


Corollary 16.5.4 Suppose that K is a compact convex subset of a normed 
space (E,||.||). Then there exists a path in K whose track is K. 


Proof If ki,k2 € K then the line segment [k1, kz] is contained in AK, and 
so K is locally path-connected. 


16.6 Rectifiable paths 


In Part Five, we turn to the problem of integrating a function along a path. 
We have however seen that paths like Hilbert’s path can behave very badly. 
We must therefore consider a more restricted class of paths. 

The trouble with Hilbert’s path and other space-filling paths is that they 
have infinite length. Let us make this explicit. If y : [a,b] + (X,d) is a path, 
its length I(7) = lja.4)(7) is defined as 


I(y) = sup Se dlig1), 1s) '@=19 < ti <nuty=b, HEN 
j=l 


It is possible that 1, ,)(7) = 00; as a simple example, if 7(0) = (0,0) and 
y(t) = (t,tsin w/t) for 0 <t <1 then y: [0,1] > R? is a simple path from 
(0,0) to (1,0) in R? of infinite length. Hilbert’s path h has infinite length, 
since if n € N then the approximation h,, has length 2” + (/2 — 1)/2", and 
I(h) > (hn) for all n. 

A path of finite length is called a rectifiable path. 


Proposition 16.6.1 Suppose that y: [a,b] + (X,d) is a path. 
(1) lian (y) = d(a, d). 
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(i) Vaoy) = Uae (7)- 
(iit) Ifa <c< bb then the restrictions of 7 to [a,c] and to |c,b] are both 


paths, and Va,c} (y) + le,b] (7) = lia,] eae 
(iv) If 8: [c,e] + (X,d) is a path similar to y then lee(B) = la (7)- 


Proof All these results follow immediately from the definitions. 


As a cautionary example, consider the norm ||(x1, x2)||,, = max(|x1|, |v2]) 
on R?. Let a = (0,0), b = (1,0) and c = (2,1), and let y be the path 
a(a,b) V o(b,c). Then Ig g(y) = 2 = ||le—all, although y is not a linear 
path. This phenomenon cannot occur in Euclidean spaces, or indeed in an 
inner product space. 


Proposition 16.6.2 Suppose that 7: |a,b] > V is a simple path in a real 
inner product space (V,(.,.)) for which lau (7) = |ly(b) — y(a)||. Then y is 
similar to the linear path a(y(a), y(b)). 


“A straight line is the shortest distance between two points” (Thomas 
Carlyle). 


Proof If the result is not true then there exists a < c < b for which 
c—aand b—care not linearly dependent. Then, using the Cauchy—Schwarz 
inequality, 


llr) — Wal? = IO — va) + 8) -— YO)? 


= Ily(o) — (aI? + lle® — VOW? +2 OO) — 1), 78) — 10) 
< IIy(e) — YI? + Ir) — VOI? + 21 — YI ly) — (0) 
= (Ily(e) — v(a)Il + llr) — VON)? S Ca, ))?, 


giving a contradiction. 


It is convenient to use path length to parametrize a rectifiable path. 


Proposition 16.6.3 Suppose that y : [a,b] + (X,d) is a rectifiable path. 
Let \(a) = 0, and let I(t) = liay(y), for t € (a,b]. Then | is a continuous 
increasing function on [a, }]. 


Proof Ifa<s<t< 6 then I(t) = I(s) +leu(y) = Us), so that J is an 
increasing function on {a, }]. 

We shall show that if a < t < b then / is continuous on the right at ¢; 
the proof of left continuity is exactly similar. Suppose that « > 0. Recall 
that I(t*) = inf{l(s) : s > t}. There exists 6 > 0 with t+ 6 < b such 
that d(y(s), y(t)) < €/3 and I(s) < I(t*) + €/3 fort < s <t+0. Thus if 
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t<s<r<t+6 then lj, 1(y) =U(r)—I(s) < €/3. Suppose that t < s < t+. 
There exist t = to < t) <...tpn = 8 such that 


S > d(y(t;-1), V(ts)) > Ua, — €/3 = Us) — Ut) - €/3. 
j=l 
Then 
I(s) —U(t) < d(y(to), v(t1)) + D_ aly (tj-1), Vey) + €/3 


j=2 
< d(y(to); V(t1)) + less (7) + €/3 
< €/3 + (I(s) — U(t1)) + €/3 <e. 


Proposition 16.6.4 Suppose that y : [a,b] > (X,d) is a rectifiable path. 
Then there exists an equivalent path B : (0, ljaaj(7)] + [y] such that lo,.)(8) = 
8, for 0 < 8 < lay (9): 


Proof Let l(a) = 0 and let I(t) = ljay(y), for t € (a,b). Then / is a 
continuous increasing function on [a,b], and I([a, b]) = [0,1(b)]. Suppose that 
0 <t < 1(b). Let I, = I-'({t}). Then J is a (possibly degenerate) closed 
interval containing t. Suppose that J; is not degenerate, and that r,s € |; 
with r < s. Then I(s) = I(r) +1, (7) = Ur), so that Ip,.j(7) = 0. Since 
d(y(r), V(8)) < lir,s)(7), it follows that y(r) = 7(s). Thus if we set B(t) = 7(s) 
for some s € I;, then (@ is properly defined, and y = 6 ol. Since y and / are 
continuous, it follows from Proposition 15.1.6 that 6 is continuous. Clearly 
[3] = [y], and tp4(8) = t, for ¢ € [0,()] 


This path is called the path-length parametrization of y. Its use simplifies 
many problems involving paths. Let a(t) = 6(t/l(y)) for 0 < t < 1; the path 
a: [0,1] > X is the normalized path-length parametrization. 

Note that if 7 is a simple rectifiable path, then the function I(t) = lja.4(7) 
is strictly increasing on (a, b], and the mapping / is a homeomorphism of [a, }] 
onto [0,1(b)]. In this case, the proof is much easier; simply set 8 = yol7!. 


Exercise 


16.6.1 Show that the function p(z,y) = /|x — y| on [0,1] is a metric on 
(0, 1] which is uniformly equivalent to the usual metric. Let y(t) = t 
for t € [0,1]. Is 7 a rectifiable path in ((0, 1], p)? 


Part Four 


Functions of a vector variable 


17 


Differentiating functions of a vector variable 


17.1 Differentiating functions of a vector variable 


In Part Two, we considered continuity and limiting properties of real- 
valued functions of a real variable — functions defined on a subset of R. 
In Part Three we extended these ideas to functions between metric spaces, 
or between topological spaces. In particular, these results apply to functions 
of several real variables — functions defined on a subset of R2. 

We now turn to differentiation. This involves linearity: we therefore con- 
sider functions defined on a subset U of a real normed space (£, ||.||,,) taking 
values in a real normed space (F;||.||-). In fact, our principal concern will 
be with functions of several real variables (functions defined on an open 
subset of R2), but it is worth proceeding in a more general way. First, this 
illustrates more clearly the basic ideas that lie behind the theory. Secondly, 
even in the case where we consider functions defined on a finite-dimensional 
Euclidean space, there are advantages in proceeding in a coordinate free 
way; not only is the notation simpler, but also the results are seen to be 
independent of any particular choice of coordinates. 

Recall (Volume I, Section 7.1) that a real-valued function f defined on an 
open interval J is differentiable at a point a of J if and only if there exists a 
real number f’(a) such that if 


r(h) = f(ath)— fla) — fi(ah 
for all non-zero h in J—a = {x € R: 2+a € J}, then r(h)/|h| > 0 as 


h + 0; that is, r(h) = o(|h|). Let us set Df, (a) = f'(a)ax, for « € R. Then 
Df, is a linear mapping from R into R and 


flat h) = fla) + Dfalh) + r(h) 
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for allheI—-a={xeER:x2+a€ I}. Thus f is differentiable at a if and 
only if we can write f as the sum of a constant (the value at a), a linear term 
Df.(h), and a small order term r(h). From this point of view, differentiation 
is a matter of linear approximation. 

These ideas extend naturally to vector-valued functions of a vector vari- 
able. Suppose that f is a function defined on an open subset U of a real 
normed space (F, ||.||,;), taking values in a real normed space (F, |].||-), and 
that a € U. We say that f is differentiable at a, with derivative D fa, if there 
is a continuous linear operator Df, € L(E,F) such that if 


r(h) = fla+h)— fla) — Dfalh) 


for all non-zeroh € U-—a= {xe BE: 4+a€ Uy}, then r(h)/||h|| > 0 as 
h — 0. Again, we express f as the the sum of a constant (the value at a), a 
linear term Df,(h), and a small order term r(h). 

Note that we require Df, to be a continuous linear mapping; this con- 
dition is automatically satisfied if E is finite-dimensional, since any linear 
operator from a finite-dimensional normed space into a normed space is con- 
tinuous (Corollary 15.9.3). Note also that the conditions remain the same 
if we replace the norm on FE and the norm on F' by equivalent norms. In 
particular, when F is finite-dimensional then we can use any norm on FE 
(and similarly for F’), since any two norms on a finite-dimensional space are 
equivalent (Theorem 15.9.2). 

Let us consider three special cases. First, when EF = R, we set f’(a) = 
Df.a(1), so that 


f(at+h) = f(a) +hf'(a)+r(h) forallhe U-a={re E:x2+a€U}; 


f’(a) is an element of F’, while Df, is a linear mapping from R into F’. Note 
that if ||Dfq|| is the operator norm of Df,, then 


||P fall = sup{||Dfa(A)Ilp = IAl S 1} = |[F'(@)||p- 


Secondly, suppose that H is a real Hilbert space, and that F = R. In this 
case, Df, is a continuous linear functional on H. By the Fréchet-Riesz rep- 
resentation theorem (Theorem 14.3.7), linear functionals can be expressed 
in terms of the inner product; there exists an element Vf, of H such that 
Dfa(h) = (V fa, h). The vector V fa is called the gradient of f at a. The sym- 
bol V was introduced by Hamilton, and named ‘nabla’ by Maxwell; ‘nabla’ is 
the Greek word for a Hebrew harp. Nowadays, it is usually more prosaically 
called ‘grad’. 
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Thirdly, suppose that f = (f1,...,fn) is a mapping from an 
open subset U of a normed space (£,]|].||~) into a finite product 
(Fi, I-Ilp,) x +++ X (Fn, |l-llp,) of normed spaces. Then f is differentiable 
at a point a of E if and only if f; is differentiable at a for each 1 < j < n; 
if so, then Dg = (COFq ax ssss (D fala): 

If f is differentiable at every point of U, we say that f is differentiable 
on U. If so, then a + Dfg is a mapping from U to the normed space 
(L(E, F’),||.||) of continuous linear mappings from FE into F’. We say that 
f is continuously differentiable at a if this mapping is continuous at a, and 
that f is continuously differentiable on U if it is continuously differentiable 
at each point of U. 

As a first example, if T € L(E, F) then T(a +h) =T(a)+T(h) for all a 
and h in E, so that T is differentiable at every point of EF, and DT, = T. 

We now have the following elementary results. 


Proposition 17.1.1 Suppose that f and g are functions defined on an 
open subset U of a normed space (E, ||.||~), taking values in a normed space 
(F, ||.||_-), that a € U, and that f and g are differentiable at a. 

(i) Dfa is uniquely determined. 

(ii) If €e > 0, there exists 6 > 0 such that Ns(a) C U and such that 
IF(a-+h) — F(a) < (Dfall +6) llbll for [lly <6. 

(iii) f is continuous at a. 

(iv) If, € R then A\f +g is differentiable at a, with derivative XD fa+ 
UD Ga. 


Proof (i) Suppose that T,,7) € L(E,F) and that 
f(at+h) = f(a) + Ti(h) + si(h) = f(@) + Ta(h) + 52(h) for he U —a, 


where s1(h)/ ||hl|,, > 0 and s9(h)/ ||h||,, + 0 as h > 0. Suppose that x is a 
non-zero element of EF. Let y = T)(x) — To(x). Since 


_ Ty (Ax) — To(Az) _ _ 81(Ax) — 82(Az) ah 
Ilr \l_z Arle Arle 


as A > 0, y = 0. Since this holds for all non-zero x in E, T; = 7». 
(ii) Let f(x +h) = f(x) + Dfa(h) + r(h). There exists 6 > 0 such that 
Ns(a) C U and such that ||r(h)||e < €|lAl|g for ||Alle < 6. Then 


[F(a +h) — fla)lle = |Dfalh) +r) < Dfolr)lle + Ilr@)lle 
< ||Dfallllhlle + €llh|lz = (|Dfall + €)[lAlla. 
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(iii) Suppose that « > 0. Let 6 satisfy the conclusions of (ii), and let 
n = 6€/(6 + 1)(||D fal] + €). If ||All eg < 7 then |All, < 6, so that 


[f(a +h) — flallle < (lPfall + ©) llPlle <¢. 


(iv) As easy as in the real scalar case. 


Theorem 17.1.2 (The chain rule) Suppose that U is an open subset of a 
normed space (E,||.||,,), that f is a function defined on U, taking values in a 
normed space (F’,||.||,-), and that f is differentiable at a point a of U. Suppose 
that V is an open set of F containing f(U) and that k is a function defined 
on V, taking values in a normed space (G,||.||q), and differentiable at f(a). 
Then the function ko f is differentiable at a, with derivative Dk fq) 0 Dfa. 


Proof Let us set b= f(a), and suppose that 
f(a+h) = f(a) + Dfalh) + r(h). 


First we simplify the problem, by showing that we can replace k by a function 
j for which Djpy = 0. Let j(y) = k(y) — Dko(y), for y € V. By Proposition 
17.1.1, 7 is differentiable at b, and Djp(y) = Dky(y) — Dko(y) = 0. 

Since Dky is linear, 


Dko(f(a + h)) = Dko(f(a)) + Dko(Dfa(h)) + Dko(r(h)). 


But ||Dky(r(h) )Ilg < [Dhol Ilr(@)llp, so that Dky(r(h))/|[hll_ > 0 as h > 
0. Thus Dky o f is differentiable at a, with derivative Dk, o Df,. Since 
(ko f)(x) = (jo f)(a) + (Dky o f)(x), it is therefore sufficient to show that 
jo f is differentiable at a, with derivative 0. In other words, we must show 
that |j(fla+h)) -—iF@M)lle/llhlle > 0 as h — 0. 

Suppose that « > 0. Let L = ||Dfj.|| + €. By Proposition 17.1.1 (ii), 
there exist 6 > 0 such that Ns(a) C U and ||f(a+h) — f(a)||p < LIlAlle 
for ||h||_~7 < 6. Since j is differentiable at b, there exists 7 > 0 such that 
N,(b) G V and 


FO +0) —JO)lla < elldlle/L, for [lle <n. 


If ||Allz < min(d,7/L) then ||f(a+h)— f(a)|lp <n. Set l= f(a+h)— f(a); 
then 6+1= f(a+h), so that 


9(f(a + h)) — j(F(a))lla < ell fla +h) — fl@)|le/L < €llhlla. 


Since this holds for all € > 0, the result follows. 
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Corollary 17.1.3 Jf f is continuously differentiable on U and k is con- 
tinuously differentiable on V then ko f is continuously differentiable on U. 


Proof For the functions x + Dky(z) and x + Df, are continuous, and 
the composition of two continuous functions is continuous. 


Let us give some examples. 
Example 17.1.4 The derivative of a bilinear mapping. 


Suppose that (F,||.||~7), (F.|l-||7) and (G, ||.||~) are normed spaces, and 
that ||.|| is a product norm on EF x F’. Suppose that B is a continuous bilinear 
mapping from the product E x F into G. Then 


B((2,y) + (h,k)) = B(x, y) + B(h,y) + B(a,k) + B(h,k), 


and ||B(h, k)llg < |Blhoollhlle lille S |Bllo-I(@,A)I?, where ||Bl,. = 
sup{||B(z,y)|le =: llzlle < 1, llylle < 1} (see Exercises 14.3.1 and 14.3.2), 
so that ||B(h,k)||@/||(h, &)|| - 0 as (h,k) — 0. Thus B is differentiable at 


each point of B x F, and DB) (h, k) = Bh, y) + B(z,k). 
Example 17.1.5 The derivative of the norm of a real inner-product space. 


Suppose that E is a real inner-product space. Let N(x) = ||a||. Can we 
differentiate N on E? If x € E \ {0} let 1,(A) = Aw. Then |{l,(A)|| = [Al |x], 
so that the mapping N ol, is not differentiable at 0. Consequently N is not 
differentiable at 0. 

Suppose on the other hand that « #4 0. We write N as a product of 
mappings, consider each factor separately, and use the chain rule. We write 
N(x) = (SoPoJ)(x), where J: E\ {0} > Ex E is defined as J(x) = (2,2), 
P: Ex E> Ris the inner product map P(z,y) = (x,y) and S: (0,00) > 
(0, co) is the square root map S(x) = /z. Then S(P(J(x))) = ||z||, and 


DJ, = J, since J is linear, 
DP yy(h,k) = (hy y) + (x, k), by the previous example, 
and DSi) = hf 26/2: 
By the chain rule, N is differentiable at x and 
DN,(h) = DSp(j(e)) DP yx) DIx(h) = DS p(s(x)) DP (a) (h; h) 


= DS pape (2(e,h)) = SP. 


Thus N is differentiable at each point of E \ {0}. 
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Example 17.1.6 The derivative of the mapping J : U — U7! from 
GL(E) to itself. 


Suppose that U is in the general linear group GL(E) of a Banach space 
E. Since GL(E) is an open subset of L(E), there exists 6 > 0 such that 
Ns(U) © GL(E). Suppose that 0 < ||T'|| < 6. Since T = (U+T) —JU, it 
follows that (U + T)-'TU-! = U-! —- (U + T)“}, and so 


(U+T)1=U1-U TU 1 +r(T), 
where r(T) = (U-1 — (U+T)-1)TU~—. Thus 
Irs Jat - +7). or]. 


Since (U + T)-! > U-! as T 5 O, it follows that r(T)/||T|| + 0 as T > 0. 
Thus J is differentiable at U, and DJy(T) = —U~'TU~!. Consequently, J 
is continuously differentiable on GL(E). 


Exercises 


17.1.1 Find the points of R¢ at which the norms ||z||, = = |x;| and 
|z||,, = max{|x,;|: 1 <j < d} are differentiable, and determine the 
derivatives at these points. 

17.1.2 Suppose that EF is a real inner product space. Let p(x) = x/ ||a||, for 
x € E \ {0}. Show that p is differentiable and that 


oh (Ei) % 
lel] jal? 


Dpx(h) 


Verify that (Dp.(h), 7) = 0. Explain the geometric reason for this. 

17.1.3 Suppose that (£, ||.||,,) is a normed space. If T € L(F), let s(T) = 
T?. Show that s is a differentiable mapping L(F') > L(F), and that 
Ds7(S) = ST +TS. 

Show that ifn € N then the mapping p™ : L(E) + L(E) defined by 
p\")(T) =T" is differentiable, and determine its derivative. 

17.1.4 Let Ma(R) be the vector space of d x d real matrices. Show that the 
mapping T > det T : Mq(R) > R is differentiable. Show that if J is 
the identity matrix then D det;(S) = tr(S) (where tr(S) = am By 
is the trace of S). Show that if T is invertible then Ddetr(S) = 
det T(tr(T!S)). 

17.1.5 Suppose that f:U—-F is a mapping from an open subset U of 
a normed space (£,||.||,~) into a normed space (F,||.||~) which is 
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differentiable at a point a of U, and suppose that there exists 
TEL(F,E) such that s = TDf,(x), for all c € EF. Show that there 
exists 6 > 0 such that N5(a) C U, and such that if x € N5(a) then 
fo) #40). 

Does there necessarily exist 6 > 0 such that N5(a) C U, and such 
that if x,y € Ns(a) then f(x) F f(y)? 


17.2 The mean-value inequality 


The mean-value theorem is a powerful result for real-valued functions on 
a closed interval in R. We cannot hope for an equivalent result for vector- 
valued functions. For example, let f : [0,27] — R? be defined by setting 
f(t) = (cost, sint) for t € [0,27]. Then f(0) = f(27) = (1,0), while f’(t) = 
(—sint, cost) 4 (0,0) for any t, so that there exists no ¢ in [0,27] for which 
f (2m) — f(0) = 27 f'(t). We can however prove an inequality, known as the 
mean-value inequality, which is extremely useful. First we consider functions 
in a closed interval. 


Theorem 17.2.1 Suppose that f : I > F is a path from a closed interval 
[a, b] in R into a normed space (F,_||.||,~) which is differentiable at each point 
of (a,b). Then 


IF() — Fla)Ilp < (6 — a) sup{||f’(o)|| p : ¢ € [a, Bf. 


Proof Let M = sup{||f’(c)||- : ¢ € [a,6]}. If M = oo there is nothing to 
prove. Otherwise, suppose that a < a’ < b! < b and that € > 0. We shall 
show that || f(b!) — f(a’)||p < (b — a’)(M 4+ €). Then since e is arbitrary, 
If (0) — f(a’) ||» < (b—a@)M. Since f is continuous at a and 8, and since the 
mapping x — ||z|| is continuous, it follows that || f(b) — f(a)||p < (b—a)M. 

Let B = {t € [a,b] : ||fO-fle@)llp < (¢-a@’)(M + ©}. Since the 
function t > || f(t) — f(a’)||» — (¢-—@’)(M +.) is continuous on [a’, b’], B is 
a non-empty closed subset of [a’, b’]. Let c= sup B. If c < 0’, it follows from 
Proposition 17.1.1 (ii) that there exists c < d < b! such that 


If@ -fOlly < 4-H Ollp+ ©) < @-o(M +). 


But then 


IF@) -f@ Ip SFO - FOlle + FO - f@) lp 
< (d—c)(M +e) + (c—a’)(M +¢) = (d—a')(M +e). 
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Thus d € B, giving a contradiction. Thus c = b’. But B is closed, and so 
b' € B; thus || f(b) — fl@ lp < 0 — a’)(M + €). 


We now extend this result to functions of a vector variable. 


Theorem 17.2.2 (The mean-value inequality) Suppose that f :U > F is 
a continuous function from an open subset U of a normed space (E,||.||~7) 
into a normed space (F,||.||~). Suppose that the closed segment [a, b] is con- 
tained in U, and that f is differentiable at each point of the open segment 
(a,b). Then 


F() — F@lle < 6 allgsup{||Dfell : ¢ € (a, b)}. 


Proof This follows from the chain rule. Let /(t) = (1—t)a+tb. Then fol is 
continuous on [0, 1] and differentiable on (0,1), and (fol)’(t) = Dfye)(b—a), 
by the chain rule. Thus 


F(6) — f@)llp < sup{||(f 0) pt € 0, 1)} 


= sup{||Dfc(b — a)llp + ¢ € (a,b)} 
< ||b — al], sup{||D fell : ¢ € (a, 6) f- 


Corollary 17.2.3 Suppose that f : U > F is a continuous function from 
a non-empty open connected subset U of a normed space (E,_||.||,~) into a 
normed space (F’,||.||;-), that f is differentiable at each point of U and that 
Df, =0 for alla ce U. Then f is a constant. 


Proof Let 2 be an-_ arbitrary element of U, and __ let 
C= {x €U: f(x) = f(xo)}. On the one hand, C is a closed subset of 
U, since f is continuous. On the other hand, if c € C, there exists 6 > 0 
such that N5(c) C U. If d € N5(c) the closed segment [c, d] is contained in U, 
and so ||f(d) — f(c)|| = 0, by the theorem. Thus d € C, and so N5(c) C C. 
Hence C is open; since U is connected, C = U. 


Corollary 17.2.4 Suppose thatG:U > L(E, F) is a continuous function 
from a non-empty connected open subset U of a normed space (E, ||.||,,) into 
L(E,F), where F is a normed space (F,||.||-). If fi and fo are any two 
solutions of the partial differential equation Df, = G(x), for x € U, then 
fi — fo is a constant function. 


Proof For D(fi — fz) = 0. 
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Corollary 17.2.5 Suppose that f : U > F is a continuous function from a 
non-empty convex open subset U of a normed space (E, ||.||,,) into a normed 
space (F,_||.||,-), that f is differentiable at each point of U and that ||D fal] < 
M for alla €U. Then f is a Lipschitz function on U, with constant M. 


Proof For if a,b € U then [a,b] C U, so that || f(b) — f(a)|| < M |lb-—a 


The following form of the mean-value inequality is also useful. 


Corollary 17.2.6 Suppose that f:U—>F is a continuous function from 
an open subset U of a normed space (E, ||.||,;) into a normed space (F, |\.|| -)- 
Suppose that the closed segment [a,b] is contained in U, and that f is 
differentiable at each point of the open segment (a,b). If T € L(E, F’) then 


f(b) — F(a) — T(b— a)|| < [6 — a|| sup{|| Df. — TI] = ¢ € (a, b)}- 


Proof Let g(x) = f(x) — T(x). Then g is differentiable, with derivative 
Df, —T, for x € (a,b). Apply the mean-value value inequality to g. 


The mean-value inequality allows us to obtain a more general version of 
Theorem 12.1.8. 


Theorem 17.2.7 Suppose that (f,)°, is a sequence of differentiable real- 
valued functions on a bounded open convex subset U of a normed space 
(E,||.||~), taking values in a Banach space (F,_||.||,-). Suppose that 

(i) there exists c € U such that f,(c) converges, to f(c) say, as n > ov, 
and 

(ii) the sequence (Dfn)°1 of derivatives converges in the operator norm 
uniformly on U to a function g from U to L(E,F). 

Then there exists a function f :U > F such that fy, > f uniformly on 
U. Further, f is differentiable on U, and Df(x) = g(x) for alla €U. 


Proof This follows by making straightforward changes to the proof of 
Theorem 12.1.8; the details are left as a worthwhile exercise for the 


reader. 


Important examples of rectifiable paths are given by piecewise continu- 
ously differentiable paths. Suppose that (£,||.||) is a Banach space. A path 
7: [a,b] > E is continuously differentiable if it is differentiable on [a, b] (with 
one-sided derivatives at a and b), with derivative 7 continuous on [a,b]. A 
piecewise continuously differentiable path, is a juxtaposition of finitely many 
continuously differentiable paths. 
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Theorem 17.2.8 Jf (E,||.||) is a@ Banach space and y is a piecewise 
continuously differentiable path in E then y is rectifiable, and lia aj(y) = 


So ll’ @IL at. 


Proof It is clearly sufficient to consider the case where y is continuously 
differentiable. Then 7’ is bounded on [a,b]; let M = sup;eja,gj ||7'(4)||. Sup- 
pose that « > 0. Let 7 = €/4(b — a). Since 7’ is uniformly continuous on 
[a,b], there exists 6 > 0 such that if s,t € [a,b] and |s —t| < 6 then 


IIlY'(s) — Ol < n- 
Suppose that 
a=to<t)<---<t, =), with tj —tj_1 <6 forl<j<n. 
By Corollary 17.2.6 
Il-v(ts) — y(tj—1) — (ty — ty-1)9/(ty-1)|] S ny — t-1), 


for 1 <j <n. Then ||y(¢;) — y(tj-1)|| < (M+ )(t; — t;-1), so that 
Eiko V(tj-1) || < (M + 7)(6— a), 


and ¥ is rectifiable. 
Now \|9’(t) — 7 (t;-1)| <1 fort € liga tal so that 


[ ||’ @)|] dé — ( — ty-1) ||7’-1) ||| S ty - 4-2), 


g-1 


and so 


g-1 


in Il’ @)|| at — lars) — yy—a) II] S 2n(ey - t)-2), 


Adding, we see that 


b n 
/ I1v’(®)|| at — SD vty) — (ja) II] < 2n(b — a) = €/2 
a 4 


b 
[oll tenn] <e 


Since € is arbitrary, the result follows. 


and so 
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Corollary 17.2.9 If I(t) = lja4(7) then | is piecewise differentiable on 
[a,b] and U'(t) = |Iy’(e)II- 


Corollary 17.2.10 Jf 6 is the path-length parametrization of y then 2 
is a piecewise continuously differentiable path, and ||6'(t)|| = 1 (suitably 
interpreted at points of juxtaposition). 


Proof For B=yolt. 


Corollary 17.2.11 Jf 6: [0,d|) — E is a continuously differentiable 
parametrization of y and ||6’(t)|| = 1 for t € [0,d], then 6 is the path-length 
parametrization of ¥. 


Proof For I(t) = fy |\"(s)|| ds =¢. 


As an example, the circular path x«,(w) in R? is differentiable, 
and (K,(w))/(t) = (-rsint,rcost), so that ||(«,-(w))’(é)|| = r, and 
lio,an](Kr(w)) = 2. 

Recall that a path is piecewise-linear, or polygonal, if it is the juxtaposition 
of finitely many linear paths. We can approximate a rectifiable path in a 
Banach space by a piecewise-linear path, without increasing path-length. 


Proposition 17.2.12 Suppose that 7 : [a,b] > U is a rectifiable path in 
an open subset U of a Banach space (E,|\.||), and that « > 0. Then there 
exists a piecewise-linear path 6 : [a,b] + U such that ||6(t) — y(t)|| < € for 
te a, bl, and lia,b) (9) < Via,b] (7). 

Proof Since [y] is compact and since y is uniformly continuous on [{a, 5] 
there exists 7 > 0 such that if s,¢ € [a,b] and |s — t| < 7 then N.(7(t)) CU 


and ||y(s) — y(t)|| < €/2. 
Let a= to <t) <-+: <tp =b be a dissection of [a,b] with t; — t;-1 < 
forl<j<k. 


0 ag Se he a = Sa) 
Sines 60) = 404) Gt) aa), 80) e Uand 
I(t) — y(é)I] S []6(4) — yj—aI + lly) — y(tj-1) Il < & 
Also, Ija,9(6) = S7*_4 Ilys) — (tsa < Ya ()- 


Exercises 


17.2.1 We have used a connectedness argument to prove the mean-value 
inequality. It can also be proved using a compactness argument. With 
the notation introduced in Theorem 17.2.1, show that there exist 
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a’ =tp < ti <--- <t, = 0! such that 
f(t) — f(j—Dlle < ( —tj-1)(M + 6) for 1 <j <k, 


and deduce the mean-value inequality. 

Give the details of the proof of Theorem 17.2.7. 

[The Newton—Raphson method] Suppose that f is a differentiable 

mapping from an open neighbourhood N;(2o) of a point xo of a 

Banach space (£, ||.||,;) into a normed space (F,||.||-), and that there 

exists s > 0 such that 

« [f(2o) lle < t/253 

e if z,y € N:(xo) then ||Dfz — Dfy|| < 1/2s; and 

e if c € N;(xo) then there exists J, € L(F, FE) with ||Jz|| < s such 
that Jz o Df, = Df, °o Jz = I, where I is the identity mapping 
on EF. 

Define (x,) by setting t, = @n_-1 — Jz(f(Xn-1)), for n € N. Use 

Corollary 17.2.6 to show that 


[fn — ails < t/2” and Wt (Gea) ls < t/artts, 


Show that (x) converges to a point to of N(x), that f(r.) = 0 
and that x is the only point in N;(ao) with this property. 
Suppose that 7 is a rectifiable path in R?. Show that, given € > 0 
there is a finite set of closed rectangles whose union contains the track 
[y], and has area less than e. Deduce that the interior of [7] is empty. 
Deduce that a space-filling path in R? is not rectifiable. 
Suppose that f is a function defined on a connected open subset U of 
a normed space (£, ||.||,;) taking values in a normed space (F’ ||.||-), 
and that ||f(z) — f(y)|lp < K |lc — yllG, for x,y € U, where K > 0 
and @ > 1. Show that f is constant. 


17.3 Partial and directional derivatives 


The derivative of a vector-valued function of a vector variable is a linear 


operator. It is desirable, where possible, to express it in simpler terms. Sup- 
pose first that (F,||-||~) = []/1(£j, ||-||;) is the product of normed spaces 
(Ej, ||.||;), and that f : U > F is a function from an open subset U of EF into 
a normed space (F*,||.||-). We can vary each variable separately. If h; € Ej, 
let 1;(hj) = hy = (0,...,0,;,0,...,0), where h; occurs in the jth place. If 
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a€U let 
Regie) =a+ te hty) = (a1, ee @j—1, 05 + ig 5 ig 15 bans inl. 


Then kaj is differentiable at every point of E;, and Dk,j;=1;. Also 
(kaj) 1(U) is an open subset of Ej, containing 0. If the mapping f o kaj : 
(kaj) 1(U) > F is differentiable at 0, we denote its derivative by D; fa; this 
is the jth partial derivative of f at a, and is an element of L(Ej, F). 

If E = R42, we set 


Di fal) = (Of /02))(a) = SE (a) 

J 

Then af af 
Fela) € P and Dyfed) = 54 (a). 


Suppose that f is differentiable at a. Then 


f (kag (hy)) = f(a + hy) = f(a) + Dfa(hy) + r(hy) 
= f (kaj(0)) + Dfa(h;) + r(hj), 


so that the jth partial derivative Dj fy exists, for 1 < j < d, and Dj fa(h;) = 
Dfalh 4), for hy € £;. Further, if h = (h; ¢ ‘_, then 


d 
Dfa(h “> Dfa(hj) = >) Djfa(hy) 
j=l 


In particular, if E = R¢ and F = R*, and f = (fi,..., fe) then 


filath) = w+ hae ey a) + r;(h). 


where (0f;/Ox;)(a) € R: the derivative Df, € L(R%,R*) is represented by 
the k x d real matrix (Of;/0x;(a)). 
When E = Ré and F = R, then 


Via= (FE(a)... Ph (a). 


In the special case where d = k, we shall need to know when the lin- 
ear operator Df, : R¢ > R? is invertible. This is the case if and only if 
the determinant of the matrix (0 f;/0x; Cake is non-zero (Appendix B, 
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Corollary B.3.2). This determinant is called the Jacobian of f at a, and is 
denoted by 


' Ol Figexsy fa) 


gO) Gas cco 


We can also consider directional derivatives. Suppose that f is a mapping 
from an open subset U of a normed space (E,||.||,;) into a normed space 
(F, ||.||~). Suppose that y € EF and that y ¢ 0. There exists € > 0 such that 
the interval [a,a + ey) is contained in U. We say that f has a directional 
derivative in the direction y if there exists an element f(a) of F' such that if 


ry(A) = f(a+ Ay) — f(a) —Afy(a), for 0<A<e, 


then ry(A)/A + 0 as A’ + 0. The vector fj (a) is then the directional 
derivative in the direction y. 

If f is differentiable at a, then it has directional derivatives in all direc- 
tions, and f, (a) = Df.(y), for y 4 0. In the appropriate case, it also has 
partial derivatives. The converse statements are not true, as the following 
simple example shows. Let 


Fe.9) = app for (#9) # (0,0) and let f(0,0) = 0. 


Then the reader should verify the following statements: 


e f is a continuous real-valued function on R?; 

e f is differentiable at every (a,b) 4 (0,0), and the mapping (a,b) > D fia») 
is continuous and bounded on R?\ {(0,0)} (use the chain rule, rather than 
elaborate calculations); 

e f has partial derivatives at (0,0) and 


Of _ Of = 


e f has directional derivatives in all directions at (0,0), all equal to 0. 


On the other hand, f is not differentiable at 0. If it were, the derivative 
would have to be 0, and so f(x, y)/||(x,y)|| would tend to 0 as (x,y) > 0. 
But f (t,t?) = t/2 for all t € R \ {0}, and so this is not the case. 

This is inconvenient, to say the least. For example, if we are investigating 
the differentiability of a function defined on an open subset of R%, the first 
step will be to find out whether or not partial derivatives exist. If they do, 
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can we use them to tell whether the function is differentiable or not? The 
next theorem gives an extremely useful test. 


Theorem 17.3.1 Suppose that (EF,||.||~)=£1 x Ey is the product of 
normed spaces (£4,||.||,) and (Eo,||.\|,) and that f:U—>F is a function 
from an open subset U of E into a normed space (F,_||.||-). Suppose that 
Di fa exists at a and that Daf, exists for all x in a neighbourhood No(a) of 
a, and is continuous ata. Then f is differentiable at a. 


Proof TIfa+heJU, let 
r(hi, he) = f(ath) — fla) — Difa(hi) — Dofalh2). 


We must show that r(h)/||h||,, 20 as h-0. Now Diro = 0, Dorp = 0 and 
Dor is continuous at 0. Suppose that «€ > 0. There exists 0 < 6 < 6 such 
that 


IIr(Aa, Olle < €l|Allg /2 and ||Darall = ||D2farn — Defall < €/2 for |[h|lz < 4. 
By the mean-value inequality, if ||h||,, < 6 then 


l|r(Aa, ha) — r(h1, 0) Il» S [lhalla sup{||Dor(a,,rn2)|] 20 < A < 1} 
<€llAlle /2, 


and so 


IIr(h) lle S Ir, OMe + [Ira ha) — 7A, O)llp < €[lAlle- 


Corollary 17.3.2 Suppose that (E,||-\|e) = Hj-1(Ej, |l-ll;) ts @ product 
of normed spaces (E;,||.||;) and that f:U — F is a function from an open 
subset U of E into a normed space (F,_||.||~). Suppose that all the partial 
derivatives (Of /Ox;)(b) exist, for all b in a neighbourhood of a, and are 
continuous at a. Then f is differentiable at a. 


Proof A simple inductive argument. 


Exercises 


17.3.1 Let g be a real-valued function on the unit sphere S¢!. Let k(0) = 0 
and let k(x) = ||2||? .g(a/ |||), for 2 4 0. Show that k has directional 
derivatives in all directions at 0. Give examples to show that k need 
not be continuous, and to show that k can be continuous, but not 
differentiable at 0. 
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17.3.2 Suppose that (£,||.||,;) is a normed space and that f : E x E > 
(F, ||-||-) is a differentiable mapping. If c € E, let g.(%) = f(x,c—2). 
Show that g.: E — F is differentiable, and determine its derivative. 
Suppose that D,f = —Do2f. Show that there is a differentiable 
function k: EF > F such that f(x,y) = k(x — y). 


17.4 The inverse mapping theorem 


We have seen in Volume I, Propositions 6.4.4 and 6.4.5 that a continuous 
function f on an open interval (a,b) of R is injective and has a continuous 
inverse if and only if it is strictly monotonic. A sufficient condition for this 
is that f is differentiable on (a,b), and that f’(x) #4 0 for all x € (a,b) 
(Volume I, Corollary 7.3.3). Thus if f’(a) 4 0 and f’ is continuous at a then 
there is a neighbourhood N,(a) such that f is a homeomorphism of N,(a) 
onto f(N-(a)). (The condition that f’ is continuous cannot be dropped: see 
Exercise 7.5.9. in Volume I.) 

We now prove a corresponding result for vector valued functions. If 
f : W — F isa mapping from an open subset W of a Banach space (£, |].||;-) 
into a Banach space (F;,||.||,-) which is differentiable at a point a of W, we 
say that Df, is invertible if Df, is a bijection of EF onto F’. By the isomor- 
phism theorem (Corollary 14.7.9), Df; + is a continuous linear mapping, so 
that Df, is an isomorphism of (F,||.||,;) onto (F,]].||-). (Df; 1 is trivially 
continuous when EF and F are finite-dimensional.) 


Theorem 17.4.1 (The differentiable inverse mapping theorem) Suppose 
that f:W-F is a differentiable mapping from an open subset W of a 
Banach space (E,|\.||;) into a Banach space (F,||.||-), and that a © W. 
Suppose that the derivative Df, is continuous at a and that Df, is invert- 
ible. Then there is a neighbourhood No(a) such that f(Ng(a)) is open in F, 
f : Ne(a) > f(No(a)) is a homeomorphism, and the inverse mapping f—' is 
differentiable at f(a), with derivative (Dfa)7'. 


Proof The proof uses the Lipschitz inverse function theorem (Theorem 
14.6.6). The first step is to simplify the problem. Let V = W — a, and let 
g(x) = f(a+a)—f (a). Then g(0) = 0 and Dg, = Dfzr+a; the mapping g from 
V to E is differentiable, and the mapping Dg: V > L(E, F) is continuous at 
0. Now let k = Dfy!og=Dg)' og. Then 


e k(0) =0, 
ek: V — E is differentiable, 
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e the mapping x + Dk, is continuous at 0, and 
e Dkyo =I. 


We prove the result for the function k. Then since f(x) = g(a—a)+ f(a) = 
Dfa(k(x — a)) + f(a), the result follows for f. 

We denote the open ball {: ||x||,, <a} with radius a by Uy. Let j(x) = 
k(x) — x, so that Djp = 0. Since the mapping x + Dj, is continuous at 0, 
there exists a ball Ug such that ||Dj~|| < 1/2 for x € Ug. If x,y € Us then 
[x,y] C Us, so that 


M =sup{||Diaje+q)||20<¢< 1} < 1/2. 
Hence, by the mean-value inequality, 
9(@) —I Mle < Mle —ylle S lle — gg /25 


thus j is a Lipschitz mapping on No(a), with constant 1/2. It therefore 
follows from the Lipschitz inverse function theorem that k(Ug) is open and 
that k is a homeomorphism of Ug onto k(Ug). 

It remains to show that k~1: k(Ug) > Ug is differentiable at 0, with deriva- 
tive I. Suppose that 0<e<1. There exists 0 < 6 < 6 such that if h € Us; 
then ||j(h)|| 7 < € |All, /2. Since k~! : k(Ug) Us is continuous, and since 
k(Ug) is an open neighbourhood of 0, there exists 7 > 0 such that U,, C k(Us) 
and (U4) GS Ug: 

Suppose that ||y|] < 7. Let k~'(y)=y+s(y). We shall show that 
IIs(wlle < €llyllg, so that k~+ is differentiable at 0, with derivative I. First, 
lly + syle = ||A*)||p < 6, 80 that 


Gy + sy) lle < elly + s)lle /2- 


Next, 
y = k(k-*(y)) = ky + 8(y)) =y + 8(y) + iy + (y)), 
so that s(y) = —j(y + s(y)). Thus 


IsMlle = ley + sole < €lly+ sMlle/2 < €llylle /2 + €llsMlla /2; 


since € < 1, ||s(y)lla < €llylle- 


Note that if the conditions of the theorem are satisfied then Df, is a 
linear isomorphism of (£, ||.||,,) onto (F’||.||,-). In practice, the theorem is 
usually applied when (£, ||.|| 7) = (F;||-||_r)- 
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Corollary 17.4.2 Suppose that f is a continuously differentiable function 
on W and that Df, is invertible, for each x © W. If V is an open subset of 
W then f(V) is open in F. 


Proof ‘This follows immediately from the theorem. 


Corollary 17.4.3 Suppose that f is a continuously differentiable function 
on W and that Df, is invertible, for each x € W. If f is injective then f 
is a homeomorphism of W onto f(W), f—' is continuously differentiable on 
f(W) and Df rq) =(Df,)',forzew. 


Proof The mapping f~! is differentiable at each point of f(W): 
we must show that Df! is continuous on f(W). But the mapping 
y> Di is the composition of the mapping y > f—!(y) : f(W) > W, 
the mapping w > Df, : W — GL(E) and the mapping J: U > U7! : 
GL(E) > GL(E£), each of which is continuous. 


A mapping f which satisfies the conditions of this corollary is called a 
diffeomorphism of W onto f(W). 


17.5 The implicit function theorem 


We have an implicit function theorem for differentiable functions. 


Theorem 17.5.1 (The implicit function theorem) Suppose that 
(Fy, |I-l,), (22, ||-Il,) and (F, ||.\|_-) are normed spaces, that (Eo, ||.||,) and 
(F,||.\|~) are complete and that f is a differentiable mapping from an open 
subset U of Ey x E into F. Suppose that a = (a,,a2) € U, that the partial 
derivative Daf, : Ey > F is invertible, and that the mapping x > Df, is 
continuous ata. Then there is a neighbourhood N of a, in Ey and a unique 
mapping 6: N — E> such that 


1. @ is continuous; 

2. the cross-section N x {a2} is contained in U; 

3. (x, o(a)) EU forxze N; 

4. f(x, o(x)) = f(a, a2) forz Ee N; 

5. o is differentiable at a1, and Doe, = —(Dofa)~!Difa- 


Thus the theorem says that there is a neighbourhood of a, on which there 
is a unique solution to the equation f(x,y) = f(a1,a2), that the solution 
is continuous in the neighbourhood, and that the solution is differentiable 
at a1. 
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(a, p(a)) 


——}—__—. 


a 


Figure 17.5. The implicit function theorem. 


Proof — As with the inverse function theorem, we simplify the problem. Let 
V = U-—a, and let g(x1, v2) = f (a1 +a1,%2+a2)— f (a1, a2), for (a1, 22) € V. 
Let k = (Dof;') og. Then k maps V into Fy, and 


e k(0) = 0, 

ek: V — Ep is differentiable, 

e the mapping x — Dk, is continuous at 0, and 
e Dako = I. 


We set T = Diko = (Dof7') 0 Di fa, and set 
j(41, 22) = k(a1, 22) — 2, (41,22) = k(a1, 22) — T (x1) — 22. 


We give FE, x Ep the norm ||(%1,22)|| = ||21||, + ||zall,. Let K = 2||T|] +1. 
Since r is continuously differentiable at 0, and Dro = 0, there exists 6 > 0 
such that N5(0) C V and ||Dr,|| < 1/2 for « € N5(0). By Corollary 17.2.5, 
I(x) — r(y)lly < Ella — yl) for x,y € No(0). Let 1» = 6/K. 

Let X, = {#1 € £1: ||x1||, < n} and let Xo = {x2 € Ey: ||xall. < d}. It 
follows from the inequality above that if x; € X, and zo, ve € X» then 


|9(@1, #2) — j(@1,2)]] = [|r(e1, 72) — r(w1, 2%)]2 S 3 [lea — 29Il9- 


It now follows from the Lipschitz implicit function theorem (Theorem 
14.6.4) that there exists a unique continuous function w : X; > Y such that 
j(1, 0(41)) = (a1) for 2, € Xy. Thus k(x1,¥(21)) = 0 for x; € Xy, and 
w is the unique function with this property. 
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Next we show that w is differentiable at 0, with derivative —T. If x1 © Xq, 
let s(a@1) = r(#1, w(a1)). Then 


(x1) = —T (a1) — s(21). 
Thus 
e(ello < ITM lerlh + 2Clealls + lleva) = C72) ler + 3 elle; 
hence ||¢(21)|p < K ||x1||,, and ||(w1, H(a1))|| < (AK +1) ||x1|,. Consequently 
s(x1)/ ||21||, + 0 as 2 > 0, and ~ is differentiable at 0, with derivative —T. 


Finally, it follows that if we set 


N = X, +a, and ¢(z) = a2 + U(x — aj) for rE N, 


then N and ¢ satisfy the requirements of the theorem. 


17.6 Higher derivatives 


Suppose that f : U > F is a mapping from an open subset U of a normed 
space (F, ||.||,;) into a normed space (F;,||.||,-) which is differentiable on U. 
Then Df is a mapping from U into L(E, F'). We consider the case where 
the mapping Df is differentiable at a point a of U. If Df is differentiable at 
a € U then we denote its derivative by D(Df)q, and say that f is twice differ- 
entiable at a. The linear operator D(Df),q is an element of L(E, L(E, F)); if 
he E then D(Df)a(h) € L(E, F). Thus if k € EF then (D(Df)a(h))(k) € F. 
We have seen (Exercise 14.3.2) that there is a natural isometric isomorphism 
j of L(E, L(E, F’)) onto the normed space B(E, E; F’) of continuous bilinear 
mappings from E x E into F’. We denote the bilinear mapping j(D(Df).) by 
D? f,. Thus D? fa(h,k) = (D(Df)a(h))(k). The mapping D? f, is the second 
derivative of f at a. 


Example 17.6.1 The second derivative of the mapping J : U > U7! 
from GL(E) to itself. 


Suppose that U is in the general linear group GL(E) of a Banach space 
E. Since GL(E) is an open subset of L(E), there exists 6 > 0 such that 
Ns(U) C GL(E). Suppose that 0 < ||T'|| < 6, and suppose that S € L(E). 
Then 


Dipag(S) =DIp(S)=-U 47) Ss +7) teu so. 
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Since (U + T)-1 =U-1 —U TU! +r(T), where r(T) = o(||T|), 


DJyu+r(S) — DJu(S) = 
Bt a0 ea eo a), 


where sg(T) = —r(T)S(U + T)-! — (U+ T)~!Sr(T) = o(||T||). Now 


Co st = 0 Te |e a 
Uo ru srt es er a se 


But U-!'TU-!SU~!TU~! = o(||T||), so that J is twice differentiable, and 
PSH fo “oo oe se re. 

This second derivative is symmetric in S and T. This is an important 
general property. 


Theorem 17.6.2 Suppose that f : U + F is a mapping from an open 
subset U of a normed space (E,||.||,,) into a normed space (F,_|].||-) which 
is differentiable on U, and twice differentiable at a € U. Then D? fa(h,k) = 
D? f4(k,h) for all (h,k) CE Xx E. 


Proof Before beginning the proof, let us see why this is a result that we 
should expect. For small h, the difference Af,(h) = f(a+h)— f(a) is a good 
approximation to Df,(h), and so for small h and k the second difference 
A? fa(h,k) = A(Afa(h))(k) is a good approximation to D? f,(h,k). But 


A? falh,k) = fla+ht+k)— f(ath)— flat+k) + f(a) 
is symmetric in h and k, and so we can expect D?f,(h,k) to have the same 
property. As we shall see, the proof is quite complicated. 
Suppose that « > 0. There exists 6 > 0 such that Ns(a) C U and 
|| D fate — Dfa — (D(Dfa))(*)|| < € lle lle, for all zg < 4. 
That is, 


|| Dfare(y) — Dfaly) — (D*fa)(#,¥))||p < €llelle llylle» 


for ||z||,~; < 6 and y € EL. First suppose that ||Al| 7 < 6/4 and ||kI|,, < 6/4. 
Let 


g(t) = Afasrim(k) = f(at+th+k) — f(a+th), for t € (—2,2), 
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so that g is a differentiable mapping from (—2,2) to F’. By the chain rule, 
J (t) = Dfattnen(h) — Dfattn(h). Since 


|D fatinge(h) — Dfa(h) — D? fa(th +k, h) Ne <ellth+ kllp |lAlle 
and ||Dfasin(h) — Dfalh) — D?falth, || p <€llthlle lle: 
it follows from the triangle inequality that if —1<t< 1 then 
|| fartnek — Dfarth — D? falk, h)|| p< 2e(Allg + Wl) llAlle 
that is, 
Ig (t) — D? falk, h)|| oS 2€(NAlle + lle) Wille. (*) 
Thus 
I|g'() - (|p < II9' (4) — D? falk, h)|| p + \|9' (0) — D? falk, h) Ml 
< 4e(|[Alle + llAlle) Alle - 


Applying the mean-value inequality of Corollary 17.2.6, 


I|9(2) — 9(0) — 9'(0)|| » < sup{||g’(t) — 9 ()||,:0<t <b 
< 4e(llAlle + |Allz) Alle. 


and so 
|9(1) — 9(0) — D? Falk, h)|| , 
< |[9(2) — 9(0) — 9'(0)]| p+ |]9’@ — D?falk, A)]] p 
< 6¢((lPll p+ All) Wl 

by (*). But 


g(1) — g(0) = f(ath+k)— flath) — f(a+k) + f(a) =A’ fa(h,k), 


and this is symmetric in h and k. Exchanging h and k, we see that 


l19(2) — 9(0) — D®fa(h, k)|| » < supf||g'(t) — 9/(0)|| .0<t< 1} 
< 6€(\All + lll) lle 
so that ||D2fu(k, h) — D?fa(h, k)|| p < 6e((lAlly + Ill 2)?. 


So far, we have only proved this inequality for small h and k. The following 
simple scaling argument shows that it holds in general. Suppose that h and 
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k are arbitrary members of E. There exists A > 0 such that ||AAl| 7 < 6/4 
and ||Ak|| 7 < 6/4. Then 


||? fa(k, h) — D* fa(h, k)|| » = ||D? fa(Ak, Ah) — D? fa(Ah, Ak) || po /A” 
< 6¢(||Mlle + AKL)? 
= 6e(|lAlle + WF lle)”: 


But ¢ is arbitrary, and so D? f,(h,k) = D? fa(k, h). 


Note that if f is twice differentiable at a then D?f(h,k) = Dp(Def)a; 
where D, and D, are directional derivatives in the directions h and k respec- 
tively. Thus D;(Dzf)a = Dy(Dpfa). In particular, if E = R4 and f is twice 
differentiable at a then 


Ox; \ Ox; Ox,0x; 
so that 
d d af 
; * 
ek De doh 1Fa,0n, ) 
i=1 g=1 
where cy e 
_ 25 o2 5 
Ox;0%; (@) Ox; 0X; (a) forlsijsd 


The results that we have established depend in an essential way upon the 
fact that f is twice differentiable at a. The existence of second directional 
and partial derivatives does not imply the symmetry result of the theorem. 
Let : 2) 
_ — ryt —Y 
f(0,0) =0, and f(a,y) = ay oe 


The reader should verify the following: 


for (x,y) # (0,0). 


e f is continuous and differentiable at every point of R?; 

e Df is continuous and differentiable at every point of R? \ {(0,0)}; 
=o 0,0) = - °F 0,0) =-1 
0x1 0x9 , 0x202X4 : ; 


There are further examples of bad behaviour, and rather specialized posi- 


tive results, some of which are included in the exercises, but we shall not 
investigate this further. 

We can also consider higher derivatives. Suppose that f is a mapping from 
an open subset U of a normed space (£, ||.||,,) into a normed space (F;, ||.|| -) 
which is (& — 1)-times differentiable on U and is k-times differentiable at 
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a. Let M;,(E,F') denote the space of continuous k-linear mappings from 
E* into F. Then we can consider D* f, as a k-linear mapping from E* 
into F: D(D*-1f), € L(E, My_1(E, F)), and D(D*-'f,)(h1)(ha,..., he) = 
D¥ fa(hi,..., hp). A function which is k-times continuously differentiable is 
called a C)-function. A function which is infinitely differentiable (a C*)- 
function, for each k € N) is called a smooth function. 


Theorem 17.6.3 Suppose that U is an open subset of a normed space 
(E,||.||~), that f is a C-function defined on U, taking values in a normed 
space (F’,||.||-), that V is an open set of F containing f(U) and that g is a 
CO) function defined on V, taking values in a normed space (G,||.\|¢). Then 
the function go f is a C)-function on U. 


Proof The proof is by induction on k. The result is true for k = 1, by 
Corollary 17.1.3. Suppose that it holds for k — 1 and that f and g are 
C*)-functions. The function « + Df, is a C*~1)-function on U, and, by 
the inductive hypothesis, the function x + Dgyz) is a C-)-function on 
U. By the inductive hypothesis again, the function x + Dgp(z) ° Dfx is a 


C‘*-))_function on U: that is to say, go f is a C)-function on U. 
Corollary 17.6.4 If f and g are smooth, then so is go f. 


Corollary 17.6.5 The inversion mapping J : GL(E) > GL(E) is a 
smooth mapping. 


Proof We need a preliminary lemma. 


Lemma 17.6.6 Suppose that (E, ||.||,,) is a normed space. Let B : L(E) > 
L(E) be defined as B(S)(T) = —STS. Then B is a smooth function. 


Proof For (DBs(H))(T) = —HTS — STH, and 


(D*Bs(H, K))(T) = —HTK — KTH, 


so that D?B = 0. 


We now prove the corollary. The proof is by induction on k. The map- 
ping J is continuously differentiable. Suppose that it is a C‘’—))-function. 
Since DJ = Bo J, the derivative DJ is a C\*-)-function; that is, J is a 
C“*)_-function. 


We also have the following result. 


Theorem 17.6.7 Suppose that f : W > F is a diffeomorphism from 
an open subset W of a Banach space (E,|\.||,,) onto a subset f(W) of a 
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Banach space (F,||.||,2) , and that f is a C)-function. Then f~' is also a 
C*) function. 


Proof Since (£,]|].||,~) and (F;||.||-) are isomorphic Banach spaces, we 
can suppose that EF = F. The proof is again by induction on k. Suppose 
that the result holds for k — 1, and that f is a C)-function. Then the 
mapping f—!: f(W) > W isa C—-function, by hypothesis, the function 
Df :U + GL(E) is a C%~)-function, and the inversion function J : 
GL(E) > GL(E) is a smooth function, and so, applying Theorem 17.6.3, it 
follows that the mapping y > OF op)” is a C(*—-)-function. Thus f~! 
is a C\)-function. 


A diffeomorphism which is a C‘*)-function is called a C\*)-diffeomorphism, 
and a diffeomorphism which is a smooth function is called a smooth 
diffeomorphism. 

We have the following symmetry result. 


Theorem 17.6.8 Suppose that f is a mapping from an open subset 
U of a normed space (E,||.||,z) into a normed space (F;,||.||-) which is 
(k —1)-times differentiable on U and is k-times differentiable at a. If o is a 
permutation of {1,...,k}, then D* f,(h1,...,hp) = D¥ (ho(1); ag ligtin): 


Proof The proof is by induction on k. It is trivially true if k = 1, and it is 
true when k = 2, by Theorem 17.6.2. Suppose that it is true for 7 < k, and 
that f is (k — 1)-times differentiable on U and is k-times differentiable at 
a. Let G be the set of permutations of {1,...,k} for which equality holds. 
Then G is a subgroup of the group }, of permutations of {1,...,k}. Let 
H = {o € dy : o(1) = I}, and let 7;,; denote the permutation which 
transposes 7 and 7. If 0 © H then by the inductive hypothesis 
D¥ fa(hi,..., he) = D(D*" fa(ha,..-,he))(h1) 
= D(D*™ fa(ho(2); fina hk) ))(Re(1)) 
= D¥ (he), sciice he(k))s 
so that H C G. In particular, 7;,; € G if neither 7 nor 7 is equal to 1. On the 
other hand, by Theorem 17.6.2, 
D¥ fa(hi,...,he) = D?(D* fa(hs,...,he)) (Ai, he) 
= D?(D*~? fa(hs,..-,he)) (he, hi) 
= DPT hs. hi, hg, tee hk), 
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so that 71.2 € G. Thus 71; = 72,;71,272,; € G. Since any permutation can be 
written as a product of such transpositions (Exercise 11.7.1), it follows that 
G = Xp. 


If h € E, let h? = (h,...,h) € E’. Since the directional derivative of f 
in the direction h is Df,(h), it follows that the jth directional derivative in 
the direction h is D/ f,(h’). 

We can also establish a version of Taylor’s theorem. We prove it for Hilbert 
spaces. A corresponding result holds for Banach spaces, but this needs the 
Hahn—Banach theorem, whose proof is beyond the scope of this book 


Theorem 17.6.9 Suppose that f is a k-times differentiable mapping from 
an open subset U of a normed space (E, ||.||;;) into a Hilbert space (F; ||.||-), 
and suppose that the segment [a,a +h] is contained in U. Suppose that 
sup{||D* fa+tn| Mt lH iM <o0, Then 


lath) = fla) + ED sy, 


j=l 
where |[re(h)|| » <M |All /k!. 


Proof We reduce this to the scalar result. If r,(h) = 0, there is nothing to 
prove. Otherwise, there exists an open interval J in R containing [0, 1] such 
that a+th € U for t € I. Let 6 = rz(h)/ ||rx(A)|| p, so that ||¢|| ; = 1 and 
(ra(h), 4) = [Ire (Pll 

Then the mapping g : J > R defined by g(t) = (f(a + th), d) is k-times 
differentiable. By the chain rule, (d’g/dx!)(t) = (D! fa+tin(h’), ¢), and so by 
Taylor’s theorem 


k; 
Vornasei)= +p Came Prat ine uel 


for some 0 < ec < 1. Thus 


re(h)[l > = (re(h), 8) = (D¥ fasen(h®) /hl, 6) 


e [PM foc Ilalle . M lV 
k! a tae 


17.6 Higher derivatives 511 


Let us consider the case where E = R%. Then the jth term in the Taylor 
expansion is 


5D fal hi) 


sie 


d d 9 9 
ee ae: hi, Oxi, oe NY 


Using the symmetry established in Theorem 17.6.9 and gathering terms 
together, we see that 


é * . q a 
Fit ae Dd! Ox}? ax 


1. hh. nt Oh Ba 
SDI) = 


jitetja=i 


Exercises 


17.6.1 There exists a continuous ou. a g on R which is not 
differentiable at any point. Let h(x = fr ol t) dt. Use h to obtain a 
continuous function f on R? such a 
(a) f is continuously differentiable at every point of R?; 

(b) Of /Ox2 is continuously differentiable at every point of R?; 
(c) 0? f /Ox1 0x2 exists at every point of R?, and is continuous on R?; 
(d) 0? f /Ox20x, does not exist at any point of R?. 

17.6.2 Suppose that f is a real-valued function defined on an open subset 
U of R?, and that Of /Ox, and 0? f /Ox20z, exist at every point of 
U. Suppose that the closed rectangle R = [a,a +h] x [b,b +k] is 
contained in U. By considering the function g(t) = Afia+tnp)((0, &)) 
and applying the mean-value theorem twice, show that there exists 
(u,v) in the interior of R for which 


2, 
hk— (u,v) = A2f,((h,0), (0,&)) 


0x202X4 
= f(a+th,b+k)— f(at+h,b) — f(a,b+k) + f(a,d). 


17.6.3 Suppose that f is a real-valued function on an open subset U of R?, 
and that Of /O21, Of /Ox2 and 0? f /Ox20x, exist at every point of 
U, and that 0? f /Ox20x, is continuous at (a,b). Suppose that € > 0. 
Show the following. 
(a) There exists 6 > 0 such that N5(a,b) C U and 


A? fia.) ((h, 0), (0, k)) _ Of 
hk O0x20X, 


(a,b)| <e, 


for 0 < |h| < 6/2, 0 < |k| < 6/2. 
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(b) If 0 < |h| <6 then 
1 / Of of O° f 
i; ac + h, b) = Bas ») = Ax 202} (a, b) 
(c) 0? f /Ox10x2(a,b) exists, and is equal to 0? f /Ox20x1(a, b). 


17.6.4 Suppose that (£,||.||,~) is a Banach space. Show that if J is the 
mapping U + U~! of GL(E) to GL(E£), then 


Diet) aly YOR a Ta 
oedy 


18 


Integrating functions of several variables 


18.1 Elementary vector-valued integrals 


We now consider the problem of integrating vector-valued functions of 
several variables. We begin by considering dissections, step functions and 
elementary integrals, as in the case of real-valued functions of a single vari- 
able. A cell C in R¢ is a subset of R4 of the form Ij x---x Ig, where ),..., Iy 
are intervals (open, closed, or neither) in R. Thus a one-dimensional cell is 
an interval and a two-dimensional cell is a rectangle. The d-dimensional 
volume or content vg(C) is defined to be vq(C) = Ne = Ly). 

Suppose that C = I, x --: x Ig is a compact cell, and suppose that 
D; = {aj = 20 < 431 < +++ < 2%, = bj} is a dissection of Jj, for 
1 <j <d, with constituent intervals Jj1,...,Ij,4,- Then D =D, x---x Dg 
is a dissection of C’. The collection of cells 


Adie Ree eT tli a hel ey Ra} 


is then the set of constituent cells of the dissection D. We list them as 
Ci,...,Ck, where k = k, ...kg, and we denote the indicator function of C; 
by xj. The mesh size 6(D) of a dissection D is the maximum diameter of a 
constituent cell. 

We order the dissections of C by inclusion: we say that D’ refines D, and 
write D < D", if Dj refines Dj, for 1 < j < d, This is a partial order on the 
set A of all dissections of C, and A is a lattice: 


DY D'=(Di UD) * << (DyU DA) and DAD = DAD’. 
A has a least element, with one cell {C}, but has no greatest element. 
Suppose now that C’ is a compact cell in R%, that D is a dissection of C, 


with constituent cells C1,...,C,, and that (F’||.|]) is a Banach space. We 
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denote by S'7(C, D) the set of all F-valued functions of the form 
2) = So) where f; € F forl<j<k. 


The elements of Sp(C,D) are F-valued functions on C. 

We define the elementary integral ail x)dx of a step function f = 
ae fx; in Sr(C, D) to be ea ¢ FpvalG;): - in the real-valued case it is 
necessary, and straightforward, to show that this is well-defined. 


Proposition 18.1.1 Suppose that f and g are step functions and that c 
is a scalar. sai se +g and cf are step eee and 


() Tat l )) dx a mde [ag(z) 
(it) eG i ae xe) dz. 


Proof ‘The proofs are the same as the proofs of the corresponding results 


in the real-valued one-dimensional case. 


Thus the set S7(C) of F-valued step functions on C is a linear subspace 
of the space Br(C) of all bounded F-valued functions on C. 

If f is an F-valued step functions on C then ||f|| is a real-valued step 
function on C. 


Proposition 18.1.2 Suppose that f € Sr(C). Then 


| [rea < fs ae 


The function i | f(x)|| dx is a norm on Sr(C). 


Proof By the triangle inequality, 


f(x) dx < Eile Ue || de. 
C 


Clearly fc |lef(x)|| da = lel fo |f(w)|| dx, and fo ||f(«)|| dx = 0 if and only 
if f = 0. If f,g € Sr(C) there exists a dissection D such that 


k k 
f= 0 fix; and g = 97 95x;- 
j=l j=l 
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Then 


k 
[ite +00 (de = S15 + alc < Sf + lglDva(C) 
j=1 


J 
_ fe ast )I| de. 


Exercise 


18.1.1 Show that the elementary integral of a step function is well-defined. 


18.2 Integrating functions of several variables 


We now consider the Riemann integral of a real-valued function of sev- 
eral variables. We follow the procedure for integrating functions of a single 
variable very closely, and we therefore omit many of the details. 

Suppose that f is a bounded real-valued function defined on a compact 
cell C in R2. We define the upper and lower integrals of f: 


fie) az = init {f g(x) dx : g a step function, g > r 
C Cc 


[@ dx = sup f h(a) dx : h a step function, h < r} : 
Jc C 


The function f is Riemann integrable if the upper and lower integrals are 
equal; if so, the common value is the Riemann integral of f. 

Recall that if f is a function on a set S taking values in a metric space 
(X,d), and A is a subset of X then the oscillation Q(f, A) of f on A is defined 
to be sup{d(f(a), f(b)) : a,b € A}; when f is real-valued then Q(f, A) = 
sup{|f(a) — f(b)| : a,b € A}, and when f takes values in a Banach space 
(F; |[-[|) then Q(f, A) = sup{||f(@) — f(®)|| : a,b © A}. 


Theorem 18.2.1 Suppose that f is a bounded real-valued function on a 
compact cell C in R4. The following are equivalent. 

(i) f is Riemann integrable. 

(ii) Whenever € > 0 there exists a dissection D of C with constituent cells 
Cl,...,C, such that 
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Proof Suppose that f is Riemann integrable and that « > 0. There exist 
step functions g and h on C such that h < f < g and 


| g(x) dx -f f(x) dx < €/2 and | f(a)dx -f h(a) dx < €/2, 
Cc Cc Cc Cc 
so that {.(g(x) — h(x)) da < €. Let D be a dissection of C, with constituent 


cells Cj,...,C;,, such that g and h are constant on each Cj. Then Q(f,C;) < 
g(a) — h(a) for x € Cj for 1 <j <k, and so 


k 
YF, C5)0alC5) < f (ole) - h(a) de <e 

j=l - 

Thus (i) implies (ii). 
Conversely, if D is a dissection of C with constituent cells C,,...,C, for 
which S7¥*_, Q(f,C;)va(C;) <e, let 
k k 
g= S¢ sup{f(2) (2 eC; xy; and A= S "inf { f(z) ce ECr yas 
j=l j=l 
Then g and fare step functions with h < f < g, and 
k 
f g(a) dx -{ h(x) dx = S° O(f, Cy)valCj) < €, 
Cc Cc F 
j=1 


so that (ii) implies (i). 


Corollary 18.2.2 If f is a continuous real-valued function on a compact 
cell C in R4, then f is Riemann integrable. 


Proof Suppose that € > 0. Since f is continuous and C is compact, f is 
uniformly continuous on C, and so there exists 6 > 0 such that if ||” — y|| <6 
then | f(x) — f(y)| < €/uvg(C). Let D be any dissection of C with mesh-size 
less than 6. If c; is any constituent cell, then Q(f,Cj) < €/va(C), and so 


€ 


k 
OCF, Cj)va(C5) < ual) 2 r(Cs) =e. 


j=l 


Similarly, we have the following elementary results. The proofs are the 
same as the proofs for real-valued functions of a single variable, and details 
are left to the reader. 
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Theorem 18.2.3 Suppose that f and g are Riemann integrable functions 
on a compact cell C in R¢. 
(i) Ifc ER then f +g andcf are Riemann integrable, and 


[ (f(x) + 9(x)) de = [ f(x) de + [ (=) dx, 


cf(x)de=c |] f(«)de. 
Cc C 


(ii) If f(x) < g(x) for alla € C then Jo f(x) dx < Jo g(a) dx. 

(iii) If f takes values in |—R, R] and ¢ is a continuous real-valued function 
on |—R, R| then do f is Riemann integrable. 

(iv) The functions f+, f-, |f|, f? and fg are Riemann integrable. 

(v) |Jo Fla) de] < fo |F(@)| de. 


Exercise 


18.2.1 Suppose that f is a real-valued function on [0,1] x [0,1], that the 
mappings xz —+ f(x,y) are increasing for each y € [0,1], and that the 
mappings y > f(x,y) are increasing for each x € [0,1]. Show that 
f is Riemann integrable. Extend this result to functions on compact 
cells in R42. 


18.3 Integrating vector-valued functions 


We now consider the problem of integrating vector-valued functions defined 
on a compact cell in R%. Suppose that f : C > (F;,||.||) is a function on a 
compact cell C' in R@ taking values in a bounded subset of a Banach space 
(F, ||.||). Since in general there is no order on F’, we cannot use upper and 
lower integrals to determine when f is Riemann integrable, and to define the 
Riemann integral. Instead, we start with the characterization of Riemann 
integrability given in Theorem 18.2.1 (i). We say that f is Riemann integrable 
if, whenever € > 0, there exists a dissection D of C, with constituent cells 
Chyiccep Op; Such that 4 ug Cs )QUF; Cy) << 

Before defining the integral, we need to establish some fundamental 
properties of Riemann integrable functions. 


Theorem 18.3.1 Suppose that f is a function on a compact cell C in R4 
taking values in a bounded subset of a Banach space F. The following are 
equivalent. 
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(i) The function f is Riemann integrable 


(it) Whenever € > 0 there exists a dissection D of C with constituent cells 
Ci,...,Cy and a partition GU B of {1,...,k} such that 


OQ(f,C;) <€ forg eG, and S> va(C; 


)<e : 
jEB 


(itt) The real-valued function || f — g|| is Riemann integrable, for each step 
function. If ¢ > 0, then there exists a step function g on C' for which 


inf {[ | f(x) — g(x)|| dx : g a step function | =(). 
C 


Suppose that (7) holds and that « > 0. There exists a dissection D 
of C with constituent cells C1,...,C, such that 


Proof 


Let G={7 2 OUf,C;) <<} and let B= {9s OV, C;) > €}.. Then 


k 
e{ 5 va(Ci)} < S52 Ci) <e, 
jcB j=l 

so that (ii) holds. 


Suppose that (ii) is satisfied, and that « > 0. Let 


n= €/(va(C) + Q(f, C)). 


There exists a dissection D of C with constituent cells C1, 


...,C such that 
the condition holds, for 7. Then 
k 
Y_ 2(F, Ci)va(Cs) 
j=l 


SAF, Cj)va(Cy) + 2 OF, Ci val 
jEB 


Cj) 
< (sup Q(/,€ §)) Do va(Cz) + QF, C) D7 va(C}) 


jEG 
< nva(C) + nQ(f, C) = e. 


Thus (ii) implies (i). 


jEB 
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Suppose that f is Riemann integrable, that g is a step function and that 
€ > 0. There exists a dissection D of C’, with constituent cells Cy,...,Czx, 
for which Sy va(Cj)Q(f, Cj) < €, and for which g is constant on each Cj. 
If x,y € C; then 


NF (@) — g(@)l — IF) — oI S IF (@) -— FOIL, 


so that 2(||f — gll ,Cj) $ Q(f, Cj). Hence 75_, va(Cj)Q(IIf — gll Cy) < ¢, 
and so ||f — g|| is Riemann integrable, by Theorem 18.2.1 (i). Now choose 
yj € Cj, for 1<j<k, and let g be the step function g = ae f (yj)xj. Then 
lf (x) — g(x)|| < QC(F,C;) for x € Cj, so that 


k 
[ F(x) — (2) ae| 


jal 


Thus (i) implies (iii). 

Conversely, suppose that (iii) holds, that « > 0 and that g is a step 
function for which J. || f(x) — g(x)|| < €/2. There exists a dissection D of 
C,, with constituent cells C,,...,C,, for which 


k 
va(C5) (on I F(a) =) < €/2, 
=1 


J 


and for which g is constant on each C;. If y, z € C; then 
IIF(y) — Flz)I| < 2 [se Il f() zo) 


so that yi va(Cj)Q(f,Cj) < €. Thus f is Riemann integrable, and (iii) 
implies (i). 


We are now ready to define the Riemann integral of a Riemann integrable 
function. Suppose that f is a Riemann integrable function on a compact cell 
C’.. For each ¢€ > 0, let 


Age {9 pace ntionesG i I F(w) — g(a) || de < cS | 
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and let Je = {fo g(a) dx: g € A-(f)}. Then A.(f) is non-empty, and J; is a 
non-empty subset of F’. If g,h € A-(f) then 


ate eNO 


a g(x) — A(a)|| dx 
< [ I F(x) — 9(x)|| de + [ise I F() — h(a) || dx < 26. 


Thus J, has diameter at most 2, and so therefore has its closure J,. It 
now follows from Corollary 14.1.12 that the intersection N{J.:¢ > Obisa 
singleton set {I}. We define I to be the Riemann integral [., f(x) dx of f. 
Note that if g € A.(f), then 


[: : [a de 


Corollary 18.3.2 Jf f is a Riemann integrable function on C then 
there exists a sequence (fp)°2, of step functions on C for which 
Jollf(2) — fr(x)|| dx > 0 as n > oo, and, for any such sequence, 
fade des fof x)dx as n—- oo. 


< diam (J.) < 2e. 


Proof Pick fn € Aijn- 


We have the following fundamental inequality. 


Theorem 18.3.3 (The mean-value inequality for integrals) Suppose that 
f is a Riemann integrable function on a compact cell taking values in a 


Banach space (F,||.||). Then 
< [ \rt@)ll ax. 
C 


[Loo 


Proof Suppose that ¢ > 0. Then there exists a step function g such that 
Jollf(z) — g(2)|| da < €/3. By the remark above, 


nde — f g(2)de 


< 2€/3. 
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Thus, applying Proposition 18.1.2, 


[ f(x) de [ g(«) de 


<[ g(x)|| dx + 2€/3 


< + 2€/3 


< ; f(a)|| de + [ I F(2) — g(a) || de + 2€/3 


<[ f(x)|| dx +e. 


Since € is arbitrary, the result follows. 


Corollary 18.3.2 enables us to establish standard properties of the 
Riemann integral. 


Proposition 18.3.4 Suppose that f and g are Riemann integrable func- 
tions on a compact cell C taking values in a Banach space (F,||.||), that h 
is a real-valued Riemann integrable function on C and thata€ R. 

(i) The functions f +g andaf are Riemann integrable and 


[te)+atw)ae= f slayae+ f ale) ae. [ote=o f sae. 


(ii) The function hf is Riemann integrable. 

(iit) Suppose that @: f(C) > (G,||.\|@) is a uniformly continuous map- 
ping from the image f(C) of C into a Banach space G. Then dof is Riemann 
integrable. 


Proof That f +g and Af are Riemann integrable follows from the facts 
(which the reader should verify) that 


O(f + 9,4) < QF, A) + Q(g, A) 
and Q(hf, A) < 2(h, A) Il flloo + IIlloo OCF; A); 


and the definition. 
There exist sequences (f)°2 and (gn)°2, of step functions such that 
Jollf(@) — fn(x)|| dz > 0 and fo ||9(@) — gn(a)|| dx + 0 as n > o0. Then 


), ieee) Hote) =e) Laer = 
| F(e) — fa(a)|| dx + [ Ig(z) — gn(x)|| de, 
C C 
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so that 


[ive G tt)) — (fal) + gn())|| dx +0 as n + 00, 


; fe) Autar= [ aan. [ gee ae. 


The proof of the result for scalar multiplication is even easier. 

(iii) Suppose that € > 0. There exists 6 > 0 such that if || f(x) — f(y)|lp < 
6, then ||(f(x)) — o(f(y))lle < €. By Theorem 18.3.1, there exists a dis- 
section D of C with constituent cells C,,...,C, and a partition GU B of 
{1,...,&} such that 


and so 


Q(f,C;) < 6 for 7 EG, and S~ va(C; \<e. 
jEB 


Then Q(¢0 f, C;) < € for 7 € G, so that, by Theorem 18.3.1, go f is Riemann 
integrable. 


The uniform limit of Riemann integrable functions is Riemann integrable. 


Theorem 18.3.5 Suppose that (fn)°, is a sequence of Riemann inte- 
grable functions on a compact cell C, taking values in a Banach space 
(F,||.||), which converges uniformly to f. Then f is Riemann integrable, 
and 


| fn(z) dz > / f(x) dx as n > ov. 
Cc C 
Proof Suppose that ¢ > 0. There exists N such that 


If — frlloo = suptllf(®) — fn(x)|| + « € C} < €/4v4(C) 


for n > N, and there exists a dissection D of C with constituent cells 
Ci,...,Ck such that >F_,va(Cj)Q(fn,C;) < ¢/2. Then O(f,C;) < 
O(fn,C +2 (|F- Poe that 


k k 
N° va(Cj) QF SD val O(fn, Cj) + 2\|f — frlloo) 
= =i 


< €/2 + 2va(C) |If — frilloo < € 


Thus f is Riemann integrable. 
Suppose now that n > N. Then 


[te )ae— [fale ae 


<[ F(2) — fala)|| dx < €/4, 
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so that 
[ fn(z) dx > | f(z) dz as n> oo. 
C Cc 


Riemann integrability can also be characterized, and the integral calcu- 
lated, using dissections with decreasing mesh size. See Exercise 1. 

So far, then, everything appears to be very straightforward. In fact, there 
are very real technical difficulties. These arise in the following circumstances. 

(i) We frequently wish to integrate functions over more general bounded 
subsets of R@ than cells. 

(ii) We would like to evaluate integrals by repeatedly calculating one- 
dimensional integrals. For example, if f is a Riemann integrable function on 
Co = [0,1] x [0,1] can we calculate 


i 1 
| f (x1, 22) dx as a repeated integral [ (/ F (@1, 29) aa) dx? 
Ce 0 0 


(iii) Can we establish a ‘change of variables’ formula of general applica- 
bility? If U and V are bounded open sets, and ¢: U — V is a continuously 
differentiable homeomorphism, can we show that 


/ f(w) de = / f(oly))Jolyldy ? 
Vv ie 


As far as (i) is concerned, a bounded subset A of a compact cell C' is 
said to be Jordan measurable if its characteristic function J4 is Riemann 
integrable. If so, the Riemann integral of [4 is called the Jordan content 
or volume of A, and is denoted by vq(A), or v(A). Clearly a cell is Jordan 
measurable, and the definition of its content as an integral agrees with the 
definition at the beginning of this section. On the other hand, we have seen 
that the indicator function of a fat Cantor set is not Riemann integrable, 
and so a fat Cantor set is not Jordan measurable. 

If A is a Jordan measurable subset of a compact cell C, with indicator 
function I,4, and f is a Riemann integrable function on C, then it follows 
from Proposition 18.3.4 (ii) that fZ,4 is Riemann integrable; we define 


[re dz = [feos dx. 
As for (ii), let 


A= {(z,y) € [0,1] x [0,1] : y is rational if c = 1/2}. 
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Then A is a Jordan measurable subset of [0,1] x [0,1], but ft T4(1/2, y) dy 
is not defined. 

As for (iii), we shall see in Section 18.4 that it can happen that U is 
Jordan measurable, but V is not. 

These difficulties suggest that we need a more sophisticated theory of inte- 
gration, and the Lebesgue integral, which is studied in Volume III, provides 
such a theory. The Riemann integral is however adequate for the integra- 
tion of continuous functions. Further, the change of variables results that 
we obtain in Section 18.7 are essential for corresponding change of variable 
results in the Lebesgue integral setting. 

As with functions of a scalar variable, we can define improper integrals, 
but some care is needed; for example, if f is defined on R%, it is natural to 
consider limits such as 


lim f(x) dx and lim J (2) de; 

R400 jell, <R Ro J'lal|_<R 
and it is relatively easy to give examples where the limits exist and are 
different. Similar remarks apply to Cauchy principal value integrals. In each 
case, it is necessary to make explicit the limiting procedure that is used. 


Exercises 


18.3.1 Suppose that f is a function on a compact cell C in R4 taking values 
in a bounded subset of a Banach space (F’, ||.||). Suppose that (D,)°°, 
is a sequence of dissections of C’ whose mesh-sizes tend to 0, and that 
Cr1,.-.,C,,%, are the constituent cells of D,. Show that f is Riemann 
integrable if and only if there exists J ¢ F' such that if y,; € C,; for 
1<j<k, andr eéN then 


ky. 
fe va G, ny) ed Ge > oo. 
j=l 


[Hint: If D’ is a dissection of C’, with constituent cells Ci,...,C;, 
let T;. be the set of constituent cells of D, which are not contained in 
one of the cells Ch. Show that 


S “{va(C, \ iC, ge Ty 0 as 66, 


18.3.2 [The fundamental theorem of calculus for vector-valued 
functions] Suppose that f is a Riemann integrable function on [a,b], 
taking values in a Banach space (£, ||.||). 
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(i) Show that if a <c <b then f is Riemann integrable on [a,b] if 
and only if it is Riemann integrable on [a,c] and [c,}], and if so 
then ae dz = So flax) dx + f? f(a) dx 

(ii) Set F(t on f(x) dx, for a <t < b. Show that F is continuous 
on [a, : 

(iii) Show that If f is continuous at t then F is differentiable at t, and 
F'(t) = f(t). (ft =a or b, then F has a one-sided derivative.) 

(iv) Suppose that f is differentiable on [a,b] (with one-sided deriva- 
tives ie a hee b). Show that if f’ is Riemann integrable then 
f(x) = f(a) + f f(t) dt fora<a<b. 

18.3.3 Suppose oe ‘ and g are che integrable functions on a cell C' in 
R‘, taking values in L(E), where (£, ||.||) is a Banach space and L(F) 
is given the operator norm. Show that fog and go f are Riemann 
integrable. Is [, f(x) 0 g(x) dx = fu g(x) °o f(x) dx? 

18.3.4 Construct a continuous real valued function f on R? for which 


lim flejdz=0 
R00 J||2||,<R 


and for which 


lim f(x) dx does not exist. 
Roce }la||,.<R 


18.4 Repeated integration 


To begin with, let us consider a continuous function f defined on a compact 
cell C = I, x --- x Ig in R%, taking values in a Banach space (F,||.||). Let 
C=Ihx--x Ig_,, so that C = C x Ig. Denote a Polit B= (lis .n2g Hg) a 
R? by (Z,t), where ¥ = (x1,...,%g_1) and t = 2q. 


Theorem 18.4.1 Let f be a continuous function from a compact cell C = 
Ty Xsoee Ty in D into a care space (F,||.||). With the notation above, 
if © E Cy. 1 let d(& =f,,f (x,t) dt. Then @ is a continuous function on Cc 


and 
Fro [af | 9(@.t) a) di. 


Proof We use the fact that f is uniformly continuous on C’. Given € > 0 
there exists 6 > 0 such that if z,y € C and d(x, y) < 6 then || f(x) — f(y)|| < 
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e/l(Ia). Thus if d(x, y) < 6 then || f(z,t) — f(y,t)|| < €/l(UZa), and so 
KG )- o@l=||f ra - 4 moe 
</ w@n-s eo doce 


Thus ¢ is continuous on C: 
Further, there exists a e function g on C' such that || f(x) — g(x)|| < 
€/2va(C) for « € C. Let W(x = Jr, 9 (Z,t) dt, for Z € C. Then 


x)de— fa) ya < [ lf(@)- 2)\| de < €/2, 
aid 
lo@- Fr )at| < [ ieee. Ol aes ey 
so that 


< €/2. 


| Loma [oma 


But fo g(x) dx = Je W(&) dz and so 


| [rae foaar 


Since € is arbitrary, the result follows. 


Corollary 18.4.2 


[t@a=f Ce | flays..-+t4) dea) ... de) dey. 


Thus we can evaluate the integral by repeatedly evaluating one dimen- 
sional integrals. Further, if o € ©, is a permutation of {1,...,d} then we 
could integrate with respect first to x,(q), then x,(q_1), and so on, and obtain 
the same result. 


Corollary 18.4.3 Ifo © Ngq then 


[ (f (ff ten. 28)dra) .- dee) i= 
i (= (fa ©1,--+;%q) dx va ae tsa dtg(1): 
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For example, if B(x, y) is the beta function i; t?-1(1 — t)¥—! dt then 


[ ([° 22.v)ar) y= [ ([ (fm a-or at) i) dy 
-[ ([ #140) ([ a-ora) dt 


ie 
do (log t) (log (1 — t)) 


We can also justify the ‘quick and easy’ proof of Theorem 9.4.4 of Volume 
I. The argument there now shows that if f and g J are continuous 27-periodic 
functions then the Fourier coefficients satisfy (f *g)n = fndn. This result 
extends easily to locally Riemann integrable 27-periodic functions f and g. 
Let M = max(||f||,,, ||g||,,). #0 < € < 1, there exist continuous 27-periodic 
functions f’ and g’ with 


20 


27 
/ f(t) — f(b) dt < €/4(M +1) and i f(t) — f(b) dt < «/4(M +1). 


Then 
tf saa — Uf *G))al < €/2 and |FaGn = Indul < 2, 


so that \(F*9)n — fndn| < €. Since € is arbitrary, the result follows. 

These results can be applied to continuous functions on R®@ of compact 
support: that is, functions which take the value 0 outside a bounded set. 
Frequently, though, we wish to consider improper integrals. As we have 
seen when we considered products of series, difficulties arise when conver- 
gence depends upon cancellation. For this reason, we restrict attention to 
the simplest case, where we integrate non-negative continuous real-valued 
functions. 

Suppose then that C = I, x--- x Iq is the product of open intervals (which 
may be semi-infinite, or infinite), and that f is a non-negative continuous 
real-valued function on C’. We then define the improper Riemann integral as 


| fe)de=sup} | f(x) dx: K a compact cell, K C ch. 
c K 


The resulting integral can then be finite or infinite. We continue with the 
notation introduced at the beginning of the section. 


Theorem 18.4.4 Suppose that f is a non-negative continuous real-valued 
function on C =I, x--- x Ig. For % € C, let ¢(%) = Sh f(x, t) dt. Suppose 
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that @(£) < oo for each TE C, and that o is a continuous function on on 


a [nerden [owas [ (ff tend) 


Proof If K = J, x--- x Jqg is a compact cell contained in C then 


[ tear= f ( | #@.Hat) w< f ( : f(e.t)at) P 
= | o@ar< [oe ae 


[ fees “Ss saya 


On the other hand, if M < Jae ~ (x) dz, there exists a compact cell K 
contained in C’ such that 


Consequently 


i. Ke IF. 


Let (L;)72, be an increasing a . compact intervals contained in Ig 
whose union is Ig, and let ¢;(%) = {, f i. (x,t) dt. Then each ¢; is a continuous 


function on K , and @; increases pointwise to the continuous function @. 
It therefore follows from Dini’s theorem (Theorem 15.2.12) that ¢; > @¢ 
uniformly on K, as 7 > oo, and so 


[oa [ow de ig eee: 
K K 
Thus there exists 7 € N such that 


[L, fede= fo @ae> Mw. 


Hence joa x) dx > M. Since this holds for all M < Se oe ) dx, it follows 


that 
[te@ace [ o@ar 
Cc Cc 
Thus we have equality. 


Thus [°(f?° B(x, y) dx) dy = fy 1/(log t log(1 — t)) dt. 
With a little care, this result can be used in cases where f is not positive. 
Let us give an example. 
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Example 18.4.5 {5° sina /xdzx = 7/2. 
Proof Suppose that K > 0. If « > 0 then 1/2 = i, e** dt, and so 


K o: K love) 
| a dz = | sin x ( ere it) dz. 
0 x 0 0 


Although sin is not a positive function, we can divide the interval [0, kK] 
into finitely many intervals, on each of which sin is either non-negative or 
non-positive, and apply the previous theorem to each of them. Thus 


K love) love) K 
| sin x (/ oe it) dz = | (/ eo sind) dt. 
0 0 0 0 


Integrating the inner integral by parts twice, 


K K 
| e-* sin x dx = [-e~* cos a|* -+f e™ cos x dr 
0 0 
K 
=1—¢*‘'cos K —tle™ sina — a e sina dx 
0 
K 
=1-—e-*'(cos K +tsin K) — ef e sina dz. 
0 
so that 
kK l 
~“" sin dx = ——> + R(t 
i eS sing dr = 75 + R(t), 
where Rx(t) = —e~**(cos K + tsin K)/(1 + t?). Since 
|cosK +tsinK|<1+t< 2(14 #7), 


|Rx(t)| < 2e~**, and so [5° Rx(t) dt > 0 as K > oo. Consequently 


[ a= lim we ae =f dt =o7 
0) x a ae 0 x 7 0 it? 2 


Exercises 


18.4.1 Suppose that f is a real-valued Riemann integrable function on a 
compact cell C in R¢ and that g is a Riemann integrable function on 
a compact cell D in R°. Show that the function (x, y) > f(x)g(y) is 
Riemann integrable on C' x D and that, with the obvious notation, 


[serrate = (f reerar) (f otwav) 
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18.4.2 Give an example of a sequence of continuous real-valued functions on 
[0, 1] which increases pointwise to a bounded function on [0,1] which 
is not Riemann integrable. 

18.4.3 Let 


P={(m/p,n/p) € [0,1] x [0,1] : p a prime, m,n € N}. 


Show that P is dense in [0,1] x [0,1], and that 


[ ([ tre. av) dt = [ (f Ip(x,y) ir) dy. 


Is Ip a Riemann integrable function on [0,1] x [0,1]? 


18.5 Jordan content 


We now investigate some of the properties of Jordan measurable subsets 
of R¢. 

Suppose that A is a subset of a compact cell C and that D is a dissection 
of C’, with constituent cells Cy,...,C,. We partition the cells of the partition 
into three: we set D = J(A) UK(A) UL(A), where 

K(A) = {Cj}: C; 1 AA and C;N(C \ A) FO} 
L(A) = {Cj : C; NA= p}. 
Thus J(A) is the set of cells contained in A, K(A) is the set of cells which 


contain points of A and points not in A, and L(A) is the set of cells disjoint 
from A. 


Theorem 18.5.1 A bounded subset A of a compact cell C in R4 is Jordan 
measurable if and only if given « > 0 there exists a dissection D of C, with 
constituent cells Cy,...,C,, such that 


S “{ua(Cy) : Cj € K(A)} <e. 
Let 


Ua(A) = inf va(C;) : D a dissection of C ? , 
C;€J(A)UK (A) 


vq(A) = sup va(C) : D a dissection of C 
C;€J(A) 
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Then A is Jordan measurable if and only if Ta(A) = ug(A). If so, then 
vq(A) = Va(A) = ug(A). 

Proof Let I, be the indicator function of A. If C; is a constituent cell in 
a dissection D, then Q([4,C;) = 0 if Cj € J(A) UL(A), and Q(4,C;) = 1 
if C; € K(A). Thus 


YS) va(C)Q(L4,0;)= S> va(Cy), 


CeD C)EK(A) 


so that A is Jordan measurable if and only if given € > O there exists a 
dissection D of C such that )){vg(Cj) : Cj € K(A)} < €. Similarly 0g(A) is 
the upper integral of 4, and v,(A) is the lower integral of I4; the remaining 
results follow from this. 


Corollary 18.5.2 A is Jordan measurable if (and only if) for each « > 0 
there are Jordan measurable subsets By and Bz of C, with By C A C Ba, 
such that vq( Bo) < va(Bi) +. 


The quantities Jy(A) and v,(A) are called the outer and inner Jordan 
contents of A. 

Thus a bounded set is Jordan measurable if it can be approximated from 
the outside and the inside by finite unions of cells, whose contents converge 
to a common value. This notion of approximating content, in two or three 
dimensions, by considering simple figures, goes back to the ancient Greeks. 
(The transition to Lebesgue measure is made by approximating by open sets 
(on the outside) and compact sets (on the inside).) 

Since the sum and product of two real-valued Riemann integrable func- 
tions are Riemann integrable, it follows that the intersection and union of 
two Jordan measurable sets A and B are Jordan measurable, and that 


ug(A U B) + ug(A is B) = uq(A) + Wal Bs 


Similarly A\ B is Jordan measurable, and vg(A\ B) = vg(A) —vaq(AN B). It 
is easy to see that if A is Jordan measurable, then so is a translate A+ a = 
{a+z:a€ A}, and that vg(A +x) = vq(A). Similarly, if A > 0 then the 
dilate XA = {Aa: a € A} is Jordan measurable, and vg(AA) = A¢vg(A). 

The notion of Jordan content makes the idea of the integral as the ‘area 
under the curve’ explicit. 


Theorem 18.5.3 Suppose that f is a bounded non-negative real-valued 
function defined on a compact cell C in R4. Let 


As={(@,y)€CxR:0<y< f(a)}. 
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Then f is Riemann integrable if and he if Af is a Jordan measurable subset 
of R¢ x R= R%"", If so, then fy f(x) de = vayi (As). 


Proof The result is certainly true if f is a step function. 

Suppose that f is Riemann integrable, and that « > : es exist step 
functions g and h with 0 < g < f <h such that [.(A(x) — g(a)) du < «. 
Then A, and Ap are Jordan measurable, and 


va41(Apn) = [ h(a) dx < [oe On 4-€= vaia(Ag) +e. 


Since A, C Ay C Ap, it follows that Ay is Jordan measurable. Since 
vq41(Ag) S Vay (Ap) S oo )and f, g(a) dx < Jo f(x) dx < JQ h(x) dz, 
it also follows that {., f(x) da = vg41(Ap). 

The converse is — in a similar way. Suppose that D is a dissection 
of C x [0, M]. If x € C, let I, = {(2,t) :0<t < M}, and let 


Jn(Ap) = {Cy € J(Az) : Ch In £9}, 
K,,(Ap) = {Cj € K(Aps) : Cp OI F OF. 
Let mp(x) = 0 if Jz(Ayr) = 9, and let 
mp(«) = sup{t : (x,t) € Cj,C; € Jn(Af)} otherwise; 
similarly, let Mp(x) = 0 if Je(Ay) U K2(Af) = 9, and let 
Mp(2) = sup{t : (x,t) € Cj,Cj € Je(Ap) U Kx (As)} otherwise. 


Then mp < f < Mp. It ae that if the condition is satisfied then f is 
Riemann integrable and f., f(x) dx = va41(Af). 


A similar result holds if we consider the set 
U;={(z,y) €CxR:0<y< f(x}. 


Let us show that there is a useful class of Jordan measurable sets. A 
convex body in R4 is a convex subset with a non-empty interior. 


Proposition 18.5.4 A bounded convex body A in R®% is Jordan measur- 
able. 


Proof We need the following easy result about convex sets. 


Lemma 18.5.5 Jf A is a convex subset of a vector space V and X > 0 
then (1+A)A=A+A)A. 
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Proof If A is any subset of V, 


(L+A)A= {a+ Aa:a€ A} C {a+ Ab: a,b€ A} =A+tAA; 


we need to establish the converse inclusion, when A is convex. If « = a+ Ab € 
A+ AA then 


,2 
1 


A 
aye 


a a 


and soz = (1+A)ye (14+A)A. 


Let us now prove Proposition 18.5.4. Without loss of generality, we can 
suppose that 0 is an interior point of A, so that there exists 7 > 0 such that 
N, (0) C A. Suppose that C' is a compact cell containing A and that € > 0. 
Let 6 = en. Then 


A+N;(0) CA+e€A=(1+6)A, 


so that if « ¢ (1+ .)A then d(z,A) > 6. Thus if D is a dissection of C 
with mesh size less than 6, and D = J(A) U K(A) U L(A), as above, then 
Uc,ex(ayC; S (1 + €)A. Consequently 


Uq(A) < vg((1 + €)A) = (1 + €)4ug(A). 


Since ¢€ is arbitrary, Jq(A) = u,(A), and A is Jordan measurable. 


Exercises 


18.5.1 Suppose that f and g are Riemann integrable functions defined on 
a compact cell C’ in R%, taking values in [a,b], and that f < g. 
Let A = {(2,y) € Cx [a,b] : f(x) < y < g(x)}. Suppose that h 
is a continuous function on C x [a,b]. Show that hl, is Riemann 
integrable and that 


[ seoaw = f ( / “ (cv) i) on 


Does a similar result hold for Riemann integrable functions h? 
18.5.2 Show that a bounded subset A of R@ is Jordan measurable if and 
only if its boundary OA has outer content 0. 
18.5.3 Suppose that f and g are Riemann integrable real-valued functions 
on a cell C in R4, and that g is non-negative. Show that there exists 
c ER, with 


inf{ f(z): 2 E€C}<c<sup{f(z):ceEC}, 
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such that f, f(a)g(x) dx = c f.. g(a) dx. 

18.5.4 Suppose that C is a compact convex body in R%. Show that C is 
homeomorphic to the closed unit ball in R?. 

18.5.5 Use induction, and repeated integrals, to calculate the volume of the 
unit ball By = {z € R@: |x|, < 1} in R¢. How does vq(Ba) behave 
as d + oo? 

18.5.6 Let Eq = {x € Ba: |x1| < 1/Vd} be an equatorial strip in By. Show 
that vq(Ea)/va(Ba) converges to a non-zero limit as d > oo. 


18.6 Linear change of variables 


In this section and Section 18.8, we consider ‘change of variables’. We begin 
by establishing a ‘change of variables’ formula for linear mappings. The 
problem here is that the notion of a cell depends upon the coordinates 
in R?. 

We therefore need to appeal to some elementary plane geometry and to 
the structure of the general linear group GLg, the group of invertible lin- 
ear mappings of R?%. We consider some simple elements of GLg. A scaling 
operator D is an element of GLg defined by an invertible diagonal matrix 
diag(\1,...,Aq), so that D(x) = (A1%1,...,Aqvq). An elementary shear 
operator R is an element of GLg defined by an elementary shear matrix: 
a matrix T’ of the form J + akj;;, where i # j and Ej; is the matrix 
with (£;;);; = 1, and with all other entries zero. Thus if T(x) = y then 
Yi = 1%; +a0x,;, and y, = x for all other indices. 


Theorem 18.6.1 JfT € GLg then T can be written as PDQ, where D is 
a scaling operator, and each of P and Q is the product of a finite number of 
elementary shear operators. 


This theorem is proved in Appendix B (Theorem B.2.3). 


Proposition 18.6.2 Suppose that T is an invertible linear mapping of R4 
onto itself and that C is a cell in R¢. Then T(C) is Jordan measurable, and 
vg(T(C)) = | det T|.vg(C). 


Proof Since T(C) is a convex body, it is Jordan measurable. Since 
det(ST) = det S. det T it is sufficient, by Theorem 18.6.1, to prove the result 
for scaling operators and for elementary shear operators. 

If T = diag(A1,...,Aq) then T(C) is a cell and 


ug(T(C)) = |Ay eee Aq|-va(C) = | det T|0g(C), 
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Figure 18.6. A cell and a sheared cell. 


so that the result holds for scaling operators. It remains to show that the 
result also holds when T = J + aj; is an elementary shear operator. Since 
the translate of a cell is a cell with the same content, it is sufficient to 
consider the case where C = (0, 1}?. 

We consider the case where a > 0; the proof for a < 0 is similar. Since 
det T = 1, setting 


e, = D,e, +e; = E,e; =F, T(e, +e;) = E’ and T(e;) = F’, 
va(T(C)) = vo(ODE'F’) = v9(ODEF’) + vo(DE’E) 
= vo(ODEF’) + v2(OF’F) = vo(ODEF) = | det T|vg(C). 


Corollary 18.6.3 T~1(C) is Jordan measurable, and 


vg(C) = | det T|.vg(T~1(C)). 


Proof For det(T~*) = 1/ det T. 


Corollary 18.6.4 If A is a Jordan measurable subset of R4, then T(A) 
is a Jordan measurable subset of R¢, and vg(T(A)) = | det T|vq(A). 


Proof The result is true if A is the finite disjoint union of cells. If 
€ > O there exist two such sets B, and Bg with By C A C Bay, and 
with vq(Bo) < va(Bi) + €/|detT|. Then T(B,) C T(A) C T(B2) and 
va(T(Bo)) < va(T(B1)) + €. Hence T(A) is Jordan measurable, by Corollary 
18.5.2, and it follows that vg(T(A)) = | det T|vq(A). 
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Corollary 18.6.5 Jf f is a real-valued Riemann integrable function on 
T(A), then foT is a Riemann integrable function on A, and 


| fle) dx=|aet | f fer 
T(A) A 


Proof If (a,t) € R’ x R, let T(x,t) = (T(z), t). Then det T = det T, and 
so the result follows by applying Theorem 18.5.3. 


We shall extend these results to a non-linear change of variables in Section 
18.8. 


18.7 Integrating functions on Euclidean space 


The results of the previous section allow us to integrate functions defined 
on a subset of Euclidean space, when there is no coordinate system in place. 
We introduce coordinates, use them to define the integral, and then show 
that this does not depend on the choice of coordinates. 

Suppose then that FE is a d-dimensional Euclidean space, and that 
(e1,..-,€a) and (e},...,e/,) are two orthonormal bases for E. If x € R4, 
we set L(x) = 4 ie, aud. Le) = _ aje’,. L and L’ are linear isome- 
tries of R? onto E, U = L~! 0 L’ is an orthogonal mapping on R%, and 
[det U7] = 1. 

Suppose now that B is a bounded subset of EL. We say that B is Jordan 
measurable if L~1(B) is a Jordan measurable subset of R42, and set vg(B) = 
vg(L-1(B)). Since L-!(B) = U(L’~1(B)), it follows from Corollary 18.6.4 
that these definitions do not depend upon the choice of basis. Similarly, if 
f is a real-valued function on B, we say that f is Riemann integrable if 
fo A is i integrable on L~'(B), and define the Riemann integral 


fag B x) dug(« 


[ f@dute)=f  pu@)az, 
B L-1(B) 
Again these definitions do not depend upon the choice of basis, this time by 
Corollary 18.6.5. 

Finally, suppose that f is a function taking values in an e-dimensional 
Euclidean space F’. Suppose that (g1,...,ge) is an orthonormal basis for 
F. We can then write f(z) = yt fj(x)g;. We say that f is Riemann 


integrable if f1,..., fe are and we set 


[ teoravate =D ( fn POR INC (2)) 9 
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This time, it is the linearity of the integral that ensures that this does not 
depend upon the choice of basis. 


Exercise 


18.7.1 What happens if we use bases of EF which are not orthonormal? What 
happens if we use bases of F’ which are not orthonormal? 


18.8 Change of variables 


We now consider a more general change of variables. 

Suppose that C is a fat Cantor subset of [0,1]. If a € [0,1], let 
f(z) = 2-—Igw(x), so that f(z) = 1 if e € C© and f(x) = 2 other- 
wise. Let Ur = {(z,y) € (0,1) x (0,2) : 0 < y < f(x)}. Then Uy is an 
open subset of R? which is not Jordan measurable. Further U f is connected, 
and its complement is also connected. It therefore follows from the Rie- 
mann mapping theorem (which we shall prove in Volume III), that there 
is a smooth diffeomorphism ¢ of the open unit square (0,1) x (0,1) onto 
Uy. This clearly has bad consequences for ‘change of variables’ results. In 
fact, the bad behaviour results from bad behaviour of ¢ near the boundary 
of (0,1) x (0,1). If we avoid this possibility, then, as we shall see, we can 
obtain some positive results. 

First, we consider what happens to compact cells. 


Theorem 18.8.1 Suppose that 6: U > V is a diffeomorphism from an 
open subset U of R4 onto an open subset V of R“%, and that Doz is invertible, 
for each x € U. If C is a compact cell contained in U then ¢(C) is Jordan 
measurable, and va(¢(C)) = Jo |Je(x)| dx, where Jy is the Jacobian of ¢. 


Proof The idea of the proof is to find a fine enough dissection of C’ such 
that we can approximate @ linearly on the constituent cells, and to use the 
estimates in the Lipschitz inverse function theorem to obtain good approx- 
imations. Since we are working with cells, it is convenient to work with the 
supremum norm on R: ||z||,, = max{|z,;| : 1 < 7 < d}, and with the 
corresponding operator norm on L(R%). 

Since $(C) is a compact subset of the open set V, there exists 6 > 0 such 
that the compact set K = {y € R?: d(y, ¢(C)) < 6} is contained in V. 

Suppose that 0 < € < 1. Choose 0 < 7 < € such that 


l-e<(l—n)?< (1+)? <1t+e. 
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The mappings Jy and D¢@ are uniformly continuous on C’ and, by Corollary 
14.6.9, Do! is uniformly continuous on K. There therefore exist 1 <M < 
oo such that 


||D¢(x)|| < M, for 2 € C, and ||D¢~!(y)|| <M, for y € K, 
and 0 < 6 < @ such that 
|Jg(x) — Jg(z")| < n/va(C), for z,x’ € C, |x — || <6, 
|| Dd(x) — De(z')|| < n/2M, for x, 2’ € C,||a —2'|| < 6, 
and ||[D¢~'(y) — De! (y’)|| < n/2M, for y,y' € K,||y—y'|| < 6. 


Suppose now that D is a dissection with mesh size less than 6, with 
constituent cells C1,...,C,, with midpoints 7,..., 7%. Let S; = Déz,, for 
1<j<k. 

Let us consider a particular cell C;. By translating U and V, we can 
suppose that 2; = 0 and that ¢(2;) = 0. If h € C; let w(h) = S;*(o(h)). 
We show that 

(1—)Cj S (Cj) € (1 + 0)Cj. 
By Corollary 17.2.6, ifh,k € C;, then 

I|o(h) — o(k) — Deo(h — k)]| < ||h — kl] sup{|| Dd — Doo] 1 € [h, kl} 

< (n/M) ||h — kil. 


If h € Cj, let x(h) = p(h) — h. Then 


IIx(h) — x(F)I] = Ie) — ok) — (h— &)I 
<M ||o(h) — o(k) — Doo(h — k)|| < 4 || — ki. 


Thus x is a Lipschitz function on C;, with Lipschitz constant 7. It therefore 
follows from the Lipschitz inverse function theorem (Theorem 14.6.6) that 


(1—m)Cj C ¥(Cj) € A+ )Gj. 
Consequently, 
(1 — n)55(Cj) © G(Cj) C (1 + 9) 55 (C4). 


We use these inclusions to estimate the upper and lower Jordan contents of 
o(C). First, 


Val O(C5)) < val(1 + 9) Sj (Cj) < (1 + ©)[Jo(2;)|va(Cy), 
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and 
i, | Js(x)| da — |Jg(xj)|va(C5)| < eva(Cj)/va(C). 
Thus 
Ba(d(C)) < _ Ba(G(C;)) 
j=l 
k 
< S01 + Jp (25) val. pes lee | |Ja(x)| dx + 
j= 
Secondly, 
val (Cj) 2 va(Sj((1 — m)Cj)) = 1 — €)Jo(@s)|va(Cy), 
so that 
k 
= oul Cj)) > (1—) D> | Jp(zs)lva(Cy) 
j=l 


ain )| dx —e. 


en € is arbitrary, it follows that ¢(C) is Jordan measurable, and that 
va(o(C = fo |Jo(x)| da. 


This result can extended to more general sets. 


Corollary 18.8.2 Suppose that B is a Jordan measurable subset of U 
contained in a compact subset K of U. Then $(B) is Jordan measurable, 
and 


v((B)) = | slo)lae. 


Proof Let L = sup{|Jg(x)| : « € K}. First suppose that B is a cell. 
Suppose that € > 0. There exists a compact cell C’ contained in B such that 
va(C’) > va(B) — e/L = va(B) — €/L. Then 


v4 o(B)) — val 0(C) = [ olw)lae— ff \Igla)|ae <« 


since ¢ is arbitrary, ¢(B) is Jordan measurable, and vq(¢(B)) = Jp |Jg(x)|dz. 
In the general case, if € > 0 there exists a finite ee of disjoint cells 
J(B)U K(B) such that 


L{C:CeJ(B)} CBCU{C: Ce J(B)UK(B)}, 
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for which vg(U{C : C € K(B)}) < €/L. Then 
va(P(U{C : C € J(B)})) > va O(U{C : C € J(B)U K(B)})) —«, 


so that ¢(B) is Jordan measurable, and it follows that vg(¢(B)) = 
fp lJo(x)|de. 


We now obtain a change of variables result for Riemann integrable 
functions. 


Theorem 18.8.3 Suppose that d6: U > V is a diffeomorphism from an 
open subset U of R@ onto an open subset V of R¢, and that Dé, is invertible, 
for each x € U. Suppose that K is a compact subset of U, that B is a Jordan 
measurable subset of K and that f is a Riemann integrable mapping from 
@(B) into R®. Then f o@ is Riemann integrable, and 


lo ty )dy= ff F(ole 2))|Jg(x)| de. 


Proof We use Theorem 18.5.3. By considering the coordinates of f, we 
can suppose that f is real-valued, and by considering f* and f~, we can 
suppose that f is non-negative. Let M = ||f||,, +1, let 


U =U x (—M,M) and V = V x (-M,M), 


and let 6(2,t) = (¢(z), t) for (x,t) € U.Thend:U > Visa diffeomorphism 
from U onto V, and D¢(a,1)(h, 8) = (Dez(h), s), so that Dox is invertible, 
for each x« € U. Further, Jz (a, t) = Jg(z), and b(Afog) = = Ay;. Since f is 
Riemann integrable, the set Ay is Jordan measurable. Applying Corollary 
18.8.2 to the mapping o-}, it follows that Aysog is Jordan measurable. Thus 
fo@is Riemann integrable, and 


Flu)dy = vaslAg) =f [Jg(e.9|dle.? 


“Kr i) Joa jax = ff sole ))lJo(@)| de. 


In many cases, the integral can be extended to the whole of U, as the 
following example shows. 


o(B) 


Proposition 18.8.4  {)~ e?/2 dt = J5- 
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Proof We use polar coordinates. Let U = (0,00) x (0,7/2), and let 
o(r,8) = (rcos@,rsin@). Then ¢ is a continuously differentiable homeo- 
morphism of U onto the quadrant V = (0,00) x (0,00) and 


cos 0 sin 0 
—rsin@ rcosdé 


D679) = | | , so that Jy(r,0) =r. 
Set f(y1,y2) = e¥i/2e-¥2/? for y = (y1, ya) € V, so that f(o(r,0)) =e" /?. 


Let Ur = [e, R] x [e, 7/2 — €] forO<e€< 1/4 < R< oo. Then 


( il edt)? = lim f(y) dy 
0 e>0,R- 00 (Ue,R) 


ms pe 
= lm e” | 2” dr dO 
e—0,R- 00 Ue r 


lim (1/2 —26)(e7 © /? — e-F'/?) = wr /2. 


e—>0,R- 00 


Exercises 


18.8.1 Let f(x,y) = (1/2m)e@’+¥)/2, Let D, = {2,y) € R? : 2? + y? < 
r7} and let B, = {(x,y) € R?;x < sy}. Calculate 


y f(a,y) d(a,y), | f(a,y) dw, y) f(a,y) d(a,y). 
D:. Bs 


and 
D,OBs 


18.8.2 Let Q = {(z,y) € R? : « > 0,y > O}. Let f(z,y) = e 1), for 
(x,y) € Q. Let 


A, ={(a7,y)€Q:a+y<r}, B,={(x,y) € R?. 24 < sy}. 


Calculate 


| f(e,y) d(e,y), | f(a,y) dle, y) fa,y) dle, y). 
Ae Bs 


and 
A,NBs 


18.8.3 Explain (or find out) the probabilistic significance of these results. 
18.8.4 Let U = (0,00) x (0,7) x (—a, 7). If (7,0, 4) € U, let 


J;(r,0,¢) = (rsin 6 cos ¢,r sin @ sin ¢, r cos 0). 


Show that J, is a homeomorphism of U onto R? \ H, where H is 
the half-plane {(z,y,z) : x < 0,y = O}, and illustrate this with a 
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18.8.5 


18.8.6 


18.8.7 


18.8.8 


18.8.9 
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sketch. Show that the Jacobian of J, is 


(2, y 2) Yr?) =r’sing. 


O(r, 6, ’) 

The coordinates (r,@,¢) are the spherical polar coordinates; r is 
the radius, 0 is the inclination angle, and ¢ is the azimuth. 
Use spherical polar coordinates to calculate the volume of the set 
(described in spherical polar coordinates) {J,(r,0,¢) : r? < sin 6}. 
Let V = (0,00) x (—7,7) x R. If (p,¢,2z) € V let J-(p,0,z) = 
(pcos ¢, psin ¢, z). Show that J, is a homeomorphism of U onto R? \ 
H, where H is the half-plane {(z, y, z) : x < 0,y = 0}, and illustrate 
this with a sketch. Show that the Jacobian of J; is 

A(x, y, 2) _ 


O(p, >, 2) 


The coordinates (p, ¢, z) are the cylindrical polar coordinates; p is 
the radius (notice that this not the same as the radius in spherical 
polar coordinates), ¢ is the azimuth and z is the altitude. 

Use cylindrical polar coordinates to calculate the volume of the set 
(described in cylindrical polar coordinates) 


Je = {(p, 6,2) : p < 2008 6/2,0 < z < 1}. 
Let P = (—1/2,0) and Q = (1/2,0). Let 
8 = s(x,y) =d((x,y),P), t= t(a,y) = d((z,y), Q), 
t= Ws, 9) —s+1, 8 =) =o —4, 


for (x,y) € R?. Show that the mapping (x,y) > (u(2,y), v(a, y)) is 
a homeomorphism of the upper half space Ht = {(x,y) : y > 0} 
onto (1,00) x (0,1). Show that 

O(u,v) _—-2y 


O(z,y) st | 


—(s+t) 

e 

| ae, 
H+ st 


Let P = (0,0, —1/2) and Q = (0,0, 1/2). Let 


8 = s(x,y,2z) =d((,y, 2), P), t= tx, y, 2) = d((a,y, z),Q), 


U= ule“, 4,2) =s+t, v= 0(4¢, 4,2) =s—t, 


Calculate 
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for (x,y,z) € R®. Calculate 
(u,v, >) 
Oe, YZ) 


where ¢ is the azimuth. [Use cylindrical polar coordinates, and the 
chain rule.] Show that 


e7 (stt) F 5 
I. st (xy, 2) _ T /€. 


18.8.10 Prove a version of Theorem 18.8.3 for continuous functions taking 
values in a Banach space. 


18.9 Differentiation under the integral sign 


Suppose that A is a compact Jordan measurable subset of R@ and that (a, b) 
is an interval in R. Suppose that f is a function on A x (a,b) taking values 
in a Banach space (F’,||.||), that the mapping x > f(z,t) from A to F is 
Riemann integrable for each t € (a,b), and that the rg i Bi fed) 
from (a,b) to F is differentiable for each x € A. Let F(t) = {, f(a,t) dz. 
When is F a differentiable function of t? If it is, then Se is 


dF . fof 
a a 5 (f sennar) = f (enact 


In other words, when can we change the order of integration and differenti- 
ation? We give just one positive result, where we impose strong conditions 
on f; it is however suitable for many purposes. 


Theorem 18.9.1 Suppose that A is a compact Jordan measurable subset 
of R@ and that (a,b) is an interval in R. Suppose that f is a continuous 
function on A x (a,b) taking values in a Banach space (F,_||.||), and that 
the partial derivative (Of /Ot)(x,t) exists at al ei of A x (a,b) and is 
a continuous function on A x (a,b). Let F(t) = [, f(x,t)dx. Then F is a 
continuously differentiable function on (a, i, = 


- =5( | sear) = [ fevae. 


Proof We use Corollary 17.2.6. Suppose that a < t < 6 and that [t — 
n,t +7] C (a,b). Suppose that « > 0. Then Of /0t is uniformly continuous 
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on A x [t — n,t + y] and so there exists 0 < 6 < 7 such that if x € A, 
u,v € [t—n,t+7] and |u—v| <6 then 


\|(Of/0t)(x, u) — (Of /Ot) (x, v)|| < €/va(A). 
Ife Aandt—d<s<t+6 then by Corollary 17.2.6 


of 


we 


(x, u) = 


as 
at 


< |s — tle/vg(A). 


[Fes - ren -ohan 


<|s—t| sup 
u€[s,t] 


Integrating, 


sf |e) - F@.9- 6-920] ae sds, 
a Ot 
which shows that F is differentiable at t, with derivative |, (Of /Ot) (x,t) dx. 


The continuity of the derivative then follows from the uniform continuity of 
Of /Ot on A x [t —,t +7]. 


Exercises 


18.9.1 Suppose that f and g are continuous functions on R for which 
Sal f(a)| da < 00 and fp |g(x)| dx < oo. Let 


A(t) = " f(x)g(y) d(x, y), where A, = {(z,y) € R?: 2+y < th. 


Show that A is differentiable, and that H’(t) = fp, f(t — x)g(a) dz. 
18.9.2 Suppose that f and g are continuous sees aie on R of 
compact support. Let 


= [ f(x)g(y) d(x,y), where B, = {(2,y) € R?: x < sy}. 


Show that K is differentiable, that K'(s) = fp af(sx)g(x) da. 
18.9.3 Suppose that f(x,y) = (1/me"@' +¥)/2, - that 


= | f(x,y) d(z,y), where B, = {(z,y) € R?:2>0,y < sz}. 
Bs 


Show that F is differentiable, and that F’(s) = 7/(1+ 87). 
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Differential manifolds in Euclidean space 


19.1 Differential manifolds in Euclidean space 


A manifold is a topological space which is locally like Euclidean space: each 
point has an open neighbourhood which is homeomorphic to an open subset 
of a Euclidean space. A differential manifold is one for which the home- 
omorphisms can be taken to be diffeomorphisms. We consider differential 
manifolds which are subspaces of Euclidean space. 

Recall that a diffeomorphism f of an open subset W of a Euclidean space 
E onto a subset f(W) of a Euclidean space F' is a bijection of W onto 
f(W) which is continuously differentiable, and has the property that the 
derivative Df, is invertible, for each x € W. If so, then f(W) is open in 
F, and the mapping f~! : f(W) — W is also a diffeomorphism. Further 
dim £ = dimF, and Df, has rank dim E, for each « € E. We split this 
definition into two parts. 

First, suppose that W is an open subset of a Euclidean space Eg of dimen- 
sion d, and that j is a continuously differentiable injective mapping of W onto 
a subset j(W) of a Euclidean space Fa,,, of dimension d+ n, where n > 0. 
Then j is an immersion if the rank of Dj, is equal to d, for each x € W; 
that is, if Dj, is an injective linear mapping of FE into F’,, for each x € W. If 
j is k-times continuously differentiable, we say that j is a C\*)-immersion, 
and if 7 is a smooth mapping, we say that 7 is a smooth immersion. 

Secondly, suppose that U is an open subset of a Euclidean space Eg4y, of 
dimension d+ n, where n > 0, and that g is a continuously differentiable 
mapping of U onto a subset g(U) of a Euclidean space F' of dimension n. 
Then g is a submersion of rank n if the rank of Dg, is n, for all « © U. Thus 
Dg, is surjective, and the null-space of Dg, has dimension d, for all « € U. If 
g is k-times continuously differentiable, we say that g is a C\*)-submersion, 
and if g is a smooth mapping, we say that g is a smooth submersion. 
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We use submersions to define the notion of a d-dimensional manifold 
M which is a subspace of a Euclidean space Fg,, of dimension d+n. A 
non-empty subset M of Egiy is a d-dimensional differential manifold if for 
each x in M there exists an open neighbourhood U, of x in Egi,y, and a 
submersion g, : U,;, + F of rank n such that 


MinU,= {y € Us: 9.(y) =0} 


locally, M is the null set of a submersion. As we shall see, the fact that the 
definition is a local one influences the way in which we establish properties 
of M. The manifold M is a C\)-manifold if each gz can be taken to be a 
C*)-submersion, and is a smooth manifold if each g, can be taken to be 
smooth. Differential manifolds are frequently called differentiable manifolds. 

If d+n= 3 and n= 1, then M is a two-dimensional surface in a three- 
dimensional Euclidean space. More generally, if n = 1, then M is called a 
hypersurface in Eq+1. In this case, we simply require that Dg, 4 0 for each 
xeu. 

Here are three easy, but important, examples. 


Example 19.1.1 The unit sphere. 


Let Ea41 be a (d+1)-dimensional Euclidean space, and let U = Eq41\{0}. 
If « € U, let g(x) = ||a||? — 1. Then Dg,(h) = 2(x,h), so that Dg, 4 0 for 
x € U. Thus g is a submersion, and the unit sphere 


54 = {x EU: g(x) =0} = {e € Bays: lle] =} 


is a d-dimensional differential manifold in Ey,1. Since g is smooth, S® is a 
smooth hypersurface in Fy41. 


Example 19.1.2 The graph of a differentiable function. 


Suppose that f is a continuously differentiable function defined on an 
open subset U of a d-dimensional Euclidean space Eg, taking values in an 
n-dimensional Euclidean space E,,. Let g(x,y) = f(x)—y, for (z,y) € UX Ey. 
Then Dg(z,y) = (Dfx, —1), so that Dg(z,y) is a linear mapping from Eg x Ey, 
into E,, with rank n. Thus 


Gy = {(a, f(x)): © € US = {(@,y) : g(@,y) = OF 


is a d-dimensional differential manifold in U x Ep. 

For the next example, we need the notion of a self-adjoint operator. 
Suppose that E is a d-dimensional Euclidean space and that T € L(£). 
Recall that the transpose T’ of T is the unique element of L(/) for which 
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(T(x), y) = (x,T"(y)), for all x,y € E. T is self-adjoint if T = T’. For exam- 
ple, if P is an orthogonal projection of E onto a linear subspace F’, and if 
x,y € E then 


(P(a),y) = (P(@), P(y)) = (@, P(y)), 


so that P is self-adjoint. If (e1,...,eq) is an orthonormal basis for E', and 
if T is represented by the matrix (t,;), then T is self-adjoint if and only if 
ti; = tj; for 1 < i,j < d. Hence the set Lq of self-adjoint linear operators 
on £ is a linear subspace of L(£) of dimension d(d + 1)/2. 


Example 19.1.3 The orthogonal group and special orthogonal group. 


Suppose that E is a d-dimensional Euclidean space, and that U = GL(E) 
is the group of invertible elements of L(£). U is an open subset of the 
d?-dimensional vector space L(E). Let g(A) = A’ A—I, for A € U. Then g(A) 
is self-adjoint, g is a smooth mapping from U into Lsq(E) and Dga(T) = 
A'T +T'A. Suppose that S$ € Lsq(E). Let T = 5A’~'S. Then 

Dga(T) = 5A'A' 18 + 4SATA=S, 
so that Dga is a surjective linear mapping from L(F) onto Lsq(£). Thus 
the orthogonal group O(£) = {A € U : g(A) = 0} is a smooth manifold of 
dimension d(d — 1)/2. O(£) is not connected; if gt is the restriction of g to 
the open subset U* = {A € GL(E) : det A > 0} of L(E£), then the special 
orthogonal group SO(E) is equal to {A € Ut : gt(A) = 0}, and is also a 
smooth connected manifold. 


Exercises 


19.1.1 Suppose that F is a differential manifold. Let E, = E x R, and let 
Eo = E x {0}. Ifa > 0, let fa(x) = ||z||° sin(1/ ||a||), for 2 € EB, and 
let Ga be the graph of fa. 

(a) Is Gg a differential manifold in E,? 

(b) Is G3 a differential manifold in F,? 

(c) Is G3 Eo a differential manifold in Eo? 

(d) Is G3 7 (Eo \ {0}) a differential manifold in Ep \ {0}? 

19.1.2 Which of the following are manifolds in R?? 

(a) {(a,y) + y? =a? — o*}. 
(b) {(sint, sin 2t) :0<t< 7}. 

19.1.3 A real-valued function f on R¢ is m-homogeneous if f(tx) = t” f (x) 
for all x € R¢@ and all t > 0. Suppose that f is continuously dif- 
ferentiable and m-homogeneous and that c € R \ {0}. Show that if 
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{x © R¢: f(x) =c} is not empty then it is a differential manifold in 
R?. What is its dimension? 

19.1.4 If E is a Euclidean group then the special linear group SL(E) is the 
set {T € GL(E) : detT = 1}. Show that SL(F) is a manifold in 
L(E). What is its dimension? 


19.2 Tangent vectors 


A curve in a Euclidean space F is a continuously differentiable mapping 6 
from an interval J in R into E (with a one-sided derivative at an end-point of 
I), and its track [6] is the image of 6. I does not need to be a closed interval, 
but if J is a closed interval [a,b], then 6 is a continuously differentiable path 
from 6(a) to 6(b) (and we call 6 a curve from d(a) to 6(b)). A curve is simple 
if 6 is injective. If 6 is a curve which is a simple closed path, we call 6 a 
simple closed curve. A curve 6: I > E is steady if ||0’(t)|| = 1, for each 
bed: 

Suppose that M is a differential manifold in a Euclidean space E, and 
that « € M. A vector h in F is a tangent vector to M at x if there exists 
an open interval (—d,6) in R and a curve w, : (—6,6) > E taking values in 
M, such that w,(0) = x and y/,(0) =h. Let 0,(t) = x + th, for t € (—6,6). 
Then h is a tangent vector to M at x if and only if ||w,(t) — 6.(t)|| = o(|t|). 

This definition does not involve submersions. The set of tangent vectors 
to M at x can be characterized in terms of submersions. 


Theorem 19.2.1 Suppose that M is ad-dimensional differential manifold 
in a Euclidean space E, that x € M, that U, is a neighbourhood of x in E 
and that g: U, > F is a submersion for which 


MU, = {y € Uz : g(y) = 0}. 
Ifh € E then h is a tangent vector to M at x if and only if Dg,(h) = 0. 


Proof Suppose first that h is a tangent vector to M at xz, and that W: 
(—d,6) — M satisfies the conditions of the definition. If t € (—6,4), then 
g(w(t)) = 0 for t € (—d,6), and so, using the chain rule, 


Dgx(h) = Dgx(v"(0)) = (g 0 b)'(0) = 0. 


The converse is harder to prove. Let us set T, = {h € E':: Dg,(h) = 0}: 
T, is the null-space of Dg,. Let P, be the orthogonal projection of EF onto 
T,. Let Qz = I— P,, and let Nz = Qz(F); Nz is the orthogonal complement 
of T,. If y € Nz, and Dg,z(y) = 0 then y € T, N Nz = {0}, so that y = 0. 
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Thus the restriction of Dg, to Nz is an injective linear map of N, into F. 
Since dim N, = rank(Dg,) = dim F, the restriction of Dg, to N; is a linear 
isomorphism of N, onto F’. 

We now define a mapping g from U, to T, x F by setting g(y) = 
(P.(y — x), g(y)). The mapping g is continuously differentiable, and Dg, = 
(P,, Dgx). Suppose that (h,k) € T, x F’. Since g is a submersion, there exists 
y € E such that Dg,(y) =k. Let z = Qz(y) +h. Since h — P,(y) € Ta, 


DGx(2) = (Pe(h), Dgx(y + (h — Pe(y)))) = (A, Dox (y)) = (h, k). 


Thus Dg, is a linear isomorphism of F onto T;, x F’. Since g is continuously 
differentiable, we can suppose, by replacing U, by a smaller neighbourhood 
if necessary, that Dg is invertible at each point of U,. Applying Corollary 
17.4.3, it follows that g is a diffeomorphism of U, onto the open subset g(Uz) 
of T, x F. 

Let © : g(U,) — U, be the inverse mapping. If h € T,, there exists 
6 > 0 such that (th,0) € g(U,z) for |t| < 6. Let w(t) = U(th,0). Then 
~(0) = x. Since g(w(t)) = (th,0), it follows that g(t) = = 0, so that 
w(t) € M. Further, ~'(0) = DVo)(h,0) = (Dgz)~1(h, 0). Since h € Ty 
and Dg,(h) = 0, it follows that Dg,(h) = (P:(h), Dgx(h)) = (h,0). Thus 
w'(0) = 


Corollary 19.2.2 The set T, of tangent vectors to M at x is a d- 
dimensional linear subspace of E, which depends neither on the choices of 
w in the definition of tangent vector, nor on the choice of the submersion g 
used to define M in a neighbourhood of x. 


The vector space 7, is called the tangent space of M at x, and the space 
N, is called the normal space of M at x. Continuing with the notation of the 
theorem, if y¢ MNU,, let d(y) = oc(y) = Px(y—2). Then g(y) = (¢(y), 9), 
so that ¢ is a homeomorphism of JMU, onto an open subset of the tangent 
space T,. The pair (Uz,¢) is called a chart of M near «. If (e1,...,€g) 
is an orthonormal basis for T,, and ¢(y) = ¢1(y)e1 + --- + da(y)ea, then 
(¢1,.--, a) are local coordinates for M in U,, or a parametrization of MNU,. 
The inverse mapping w = ¢,' : (Uz) 4 Uz is an immersion. 

For example, if M = S"-! = {x € E: g(x) = |lz||? = 1} is the unit 
sphere in an n-dimensional space and if « € M then Dg,(h) = 2(x,h), and 
so T, = x+, the space of vectors orthogonal to 2, and N, = span (z). 


Corollary 19.2.3 Suppose that V is an open subset of a Euclidean space 
F, that f : V > E is a continuously differentiable mapping and that f(V) CG 
M. Then Dfx(F) © Tq) for each x € F. 
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Proof Suppose that Uy) is an open neighbourhood of f(z) and that 
g : Us(z) — G is a submersion for which 


M NU ya) = {y € Us(a) : gly) = OF. 


Let Vz = f~'(Ugia)). Then g(f(y)) = 0 for y € Vz, and so D(go f)z = 
Dg F(z) ° Dfx = 0. Thus Dfz(F) c Tha): 


Corollary 19.2.4 Jf e« > 0 there exists 6 > 0 such that ify © M and 
lly — x|| <4, then ||(y — x) — de(y)|l < €|léz(y)l| < Elly — 2I- 

Proof The theorem shows that Dyo is the identity mapping on T,. If 
v € oz (U;), let 0(v) = v(v) — v. Then 6 maps ¢,(U,) into E, 0(0) = x and 


D069 = 0. Since D@ is continuous at 0, there exists 6 > 0 such that if ||u|| < 6 
then ||D6,|| < €/2. If |ly — || < 6 then 


lox(y) I] < Pell - lly — el] = lly — 2I] < 6, 
so that 


I(y — 2) — O(y)I = Ib(Ge(y)) — b2(y) — all 
= ||0(42(y)) — AO) II < €lloW)IL- 


by the mean value inequality. 


Corollary 19.2.5 Jf «> 0 there exists 6 > 0 such that ify © M and 
lly — a] <6, then ||ee(y)Il < lly — all < 1 + €) Ilee(y)I]- 


Proof For 


Ilge(y)Il < lly - ell < Iw — a — ge(y)) + b2(y) 
S |ly— 2 — de(y)Il + loc) Il < CL + ©) [Ibe (y) 


Theorem 19.2.6 Suppose that M is ad-dimensional differential manifold 
in a Euclidean space E, and that (Uz,¢z) and (Uy,¢y) are two charts. If 
Uz, Uy #0 then dy bz! : d2(Uz Uy) 4 by(Uz Uy) is a diffeomorphism, 
which is a C)-diffeomorphism if M is a C\*)-differential manifold, and is 
smooth if M is a smooth manifold. 
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Proof This follows directly from the corresponding properties of the 


mappings g and g~' defined in the preceding theorem. 


These results lead to the notion of an abstract differential manifold. This 
is a Hausdorff topological space (M,7) with the property that there is a set 
C = {(U,¢)} of charts, where 


e the sets U are open subsets of M which cover M, 

e if (U,¢) EC, then ¢ is a homeomorphism of U onto an open subset ¢(U) 
of R¢, 

e if (U,d) and (V,7~) are charts and UNV ¢ 0 then the restriction of od7! 
to (UNV) is a diffeomorphism of 6(U NV) onto V(UNV). 


Such a manifold M has a much weaker structure than a differential manifold 
which is a subspace of a Euclidean space (for example, there is no natural 
metric on M), but the study of these manifolds is the concern of differen- 
tial geometry.! We restrict our attention to differential manifolds which are 
subspaces of Euclidean spaces. 


Example 19.2.7 The tangent bundle of a C(?)-differential manifold in a 
Euclidean space. 


Suppose that M is a d-dimensional C(?)-differential manifold in a 
Euclidean space E. We define the tangent bundle T(M) of M to be the 
subset {(z,v) : « € M,v € T,} of E x E. Let us show that T(M) is a 
2d-dimensional differential manifold in £ x E. Suppose that 7 € M, that U, 
is an open neighhbourhood of x in E and that g : U, > F is a submersion 
for which MU, = {y € U; : g(y) = 0}. Define G: U, x E> F x F by 
setting G(y,v) = (9(y), Dgy(v)). Then 


T(M) 0 (Ur x E) = {(y,v) : G((y, v)) = Of. 


We must show that G is a submersion. G is continuously differentiable, and 
the matrix of partial derivatives is 


Dg, 0 
D?gy(v,-) Dy 


Since Dg, has rank d, DGi,,,) has rank 2d, and so G is a submersion. 
We have used submersions to define differential manifolds in a Euclidean 
space. We can also use immersions to do this. 


1 For a good introduction, see Dennis Barden and Charles Thomas, An Introduction to Differential 
Manifolds, Imperial College Press, 2003. 
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Theorem 19.2.8 Suppose that M is a subset of an n-dimensional 
Euclidean space E with the property that for each « € M there is an open 
neighbourhood U of x in E and an immersion j from an open subset V 
of a d-dimensional Euclidean space F’, with j(V) = MNU. Then M is a 
d-dimensional differential manifold in E. 


Proof Suppose that x € M. Let z = j~'(x) and let K be the orthogonal 
complement in EF of Dj,(F’): K is an (n — d)-dimensional subspace of E, so 
that dim (F' x K) =n. Define 7: V x K > E by setting 7(y, w) = j(y) + w. 
Then 7 is continuously differentiable, and D7 = (Dj, J), where J: K > Eis 
the inclusion mapping. Then rank(D7,) = n, and so, by the inverse mapping 
theorem, there is a neighbourhood W of x contained in U such that 7 is a 
diffeomorphism of 7~'(W) onto W. Let g = (f,g) be the inverse mapping. 
Then g is a submersion of V into W. Further, g(y) = 0 if and only if g(y) = 
(f(y),0), which happens if and only if y = j(g(y)) = j(f(y)) © MAW. 
Thus / is a differential manifold. 


If the immersions are C\)-immersions, then M is a C*) differential 
manifold, and if the immersions are smooth then M is a smooth manifold. 


Exercise 


19.2.1 Suppose that M is a d-dimensional C(?)-differential manifold in a 
Euclidean space FE. The unit sphere bundle S(M) is defined to be 
S(M) = {(a2,v) € T(M) : |u|] = 1}. Show that S(M) is a 2d-1 
differential manifold in E x E. 


19.3 One-dimensional differential manifolds 


The simplest examples of differential manifolds are the one-dimensional ones. 
In the next theorem, we classify the connected one-dimensional manifolds 
contained in a Euclidean space. The results are hardly unexpected, but the 
proofs require some care, and illustrate the use of the results concerning 
paths that have been established earlier. 


Theorem 19.3.1 Suppose that M is a connected one-dimensional mani- 
fold in an open subset U of an open subset U of a Euclidean space E. There 
are two possibilities. 

First, M is a compact subset of U and there exists a steady simple closed 
curve y in U such that M = [4], the track of y. 
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Secondly, M is not a compact subset of U, and there exists an open inter- 
val I in R and a steady simple curve 8: I + U such that M = [6], so that 
B is a homeomorphism of I onto M. 


Proof We prove a series of lemmas. 


Lemma 19.3.2 Suppose thatx € M. There exists an open neighbourhood 
W, of x in E and a chart yz: W,NM — T, such that y2(W, MM) is an 
interval (—h,h) in T,. If y © (Wz AM) \ {x} there exists a unique simple 
steady path 8 : (0, L] + M from = to y. 


Proof There exist a neighbourhood U, of x in E and a chart @, :U,1M 
to the one-dimensional tangent space T,. Since ¢,(U; M M) is open in T;, 
there exists an interval (—h,h) C ¢2(Uz MM). Let W, = ¢;1(—h,h) and 
let yz be the restriction of ¢, to W,M M. Let y(t) = tdz(y) for O<t <1. 
Then x,' 07 is a simple curve in M from x to y; let 6 be its path-length 
parametrization. By Corollary 17.2.10, ||S’(¢)|| = 1; thus 6 is a steady curve. 
It follows from Corollary 17.2.11 that 6 is unique. 


We have the following uniqueness result. 


Lemma 19.3.3 Suppose that 3, : [0,L] > M and £2: [0,L] — M are 
two steady curves in M for which 6,(0) = 82(0) and B{(0) = 84(0). Then 
Bi = Pe. 


Proof Let 
G = {t € [0, L] : Bi(s) = Go(s) and 6}(s) = 64(s) for 0 <s < #}. 


Since the functions under consideration are continuous, G is a closed interval 
(0, g] in [0, Z]. It follows easily from Lemma 19.3.2 that G is also an open 
subset of [0, Z]; since [0, LZ] is connected, G = (0, L]. 


We now define a relation ~ on M by setting x ~ y if there exists a steady 
curve 6: I — M for which z, y € [6]. 


Lemma 19.3.4 The relation ~ is an equivalence relation. 


Proof Clearly x ~ x, and x ~ y if and only if y ~ x. Suppose that 7 ~ y 
and y ~ z, so that there exist a steady curve 6, : [0,Z,] + M from x to 
y, and a steady curve d2 : [0,L2] + M from y to z. If 65(0) = 6{(L1) then 
6, V 62 is a steady curve in M from x to z. Otherwise, 65(0) = —d{(L1). 
Suppose that Ly > La. Let d[ : [0,L2] + M be the reversal of d2. Then 
it follows from Lemma 19.3.3 that 7 (t) = 62(t) for 0 < t < Ly. Thus 
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z = o7 (Le) = 61(L1 — Le) € [61], so that x ~ z. A similar argument shows 
that if Ly < Lg then x € [69], so that x ~ z. 


Lemma 19.3.5 Ifz,yeM thenxu~y. 


Proof It follows from Lemma 19.3.2 that each equivalence class is open in 
M. Since M is connected, M is the unique equivalence class. 


We now complete the proof of the theorem. Choose 7g € M and choose 
a unit tangent vector ho in T;,,. Let 


I={t €R: there exists a steady curve 6: [0,t] > M@ 
with 6(0) = xo and 6’(0) = ho}. 


(Here we allow t to be negative, or zero.) It follows from Lemma 19.3.2 that 
I is an open interval, from Lemma 19.3.3 that there exists a steady curve 
6: I —> M with 6(0) = xo and 6’(0) = Ao and from Lemma 19.3.5 that 
[6] = M. 

There are now two possibilities. First, 6 is a simple curve. In this case, the 
second possibility holds. Secondly, there exist s,t € J, with s < t such that 
6(s) = 6(t). It then follows from Lemma 19.3.3 that if s+u,t+u€J then 
6(s+u) = 6(t+u). Thus 6 is periodic, and J = R. Let to be the fundamental 
period of 6: the least positive number for which 4(to) = 6(to +t) for allte R 
(see Volume I, Section 6.3, Exercise 1). Then 6 : [0,to] > M is a steady 
closed curve in M, with [6] = M. Finally, 6 : [0,t 9] > M is a simple closed 
curve, for if there exist 0 < s <t < to with t— s < to for which 5(s) = d(#), 
then it follows from Lemma 19.3.3 that t — s is a period of 6, contradicting 


the minimality of to. 


The leftmost trefoil in Figure 19.3a represents the track of a closed curve 
in the plane. It is not a manifold, since the curve intersects itself. The other 
two trefoils represent the track of curves in a three-dimensional space F. 
They are both manifolds, and are diffeomorphic to each other. On the other 
hand, there is no homeomorphism of F onto itself, mapping one manifold 


OQ 


Figure 19.3a. Three trefoils. 
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rae, 


Figure 19.3b. The Klein bottle. 


onto the other, since one curve is knotted and the other is not (we shall not 
prove this). 

Similar phenomena happen in higher dimensions. The Klein bottle illus- 
trated in Figure 19.3b is not a manifold in three-dimensional space EF, since 
it is self-intersecting. In fact, there is no manifold in a three-dimensional 
space which is homeomorphic to it, since it is a one-sided surface. On the 
other hand, it can be represented as a manifold in a four-dimensional space: 
take the fourth dimension to be time, start at time 0 at the circle of self- 
intersection, and proceed outwards in both directions, reaching the circle of 
intersection again at time 1. 


Exercise 


19.3.1 Give an example of a one-dimensional manifold in the plane with 
infinitely many connected components. Can there be uncountably 
many connected components? 


19.4 Lagrange multipliers 


We begin with a result which corresponds to Rolle’s theorem. We need some 
definitions. 

Suppose that f is a real-valued function on a topological space (X,7), 
and that « € X. Then f has a local maximum (strict local maximum) if 
there is a neighbourhood V of x for which f(y) < f(x) (f(y) < f(x)) for 
y € V \ {x}. Local minima and strict local minima are defined similarly. 

Suppose that f is a differentiable function on an open subset U of a 
normed space (£,||.||) and that « € U. Then z is a stationary point of f if 
Di =. 
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Proposition 19.4.1 Suppose that f is a differentiable function on an open 
subset of a normed space (E,||.||) and that f has a local maximum or local 
minimum at x. Then x is a stationary point of f. 


Proof Suppose that f has a local maximum at x, and that V is a neigh- 
bourhood of x for which f(y) < f(x) for y € V. If h € E, there exists 6 > 0 
such that «+ th € V for |t] < 6. Then 


so that Df,(h) = 0. This holds for all h € E, and so Df, = 0. The proof 


for a local minimum is exactly similar. 


It is important to note that the converse is not true. For example the 
function f(x) = «° on R has a stationary point at 0, but is strictly mono- 
tonic, and the function f(x,y) = xy on R? has a stationary point at (0,0), 
but takes positive, negative and zero values in any neighbourhood of (0,0). 

Suppose now that M is a differential manifold in an open subset U of 
a Euclidean space £, and that f is a continuously differentiable function 
on U. We consider the restriction of f to M. Suppose that « € M. Then 
f has a constrained local maximum (constrained strict local maximum) if 
there is a neighbourhood V of «x for which f(y) < f(x)(f(y) < f(x)) for 
y € MO (V \ {a}). Constrained local minima and constrained strict local 
minima are defined similarly. 

Recall that we use the gradient Vf; to describe the derivative of f at 
xz: Vfe € E and Df,(h) = (h,V fz). Now E is the orthogonal direct sum 
E=T,®WN,, where T, is the tangent space at x and N, is the normal space 
at x. We write 


Vie =Vrfea+Vnfe, with Vrfe € Ty and Vin fe € Nz : 


Vrfx is the tangential gradient and Vn fz is the normal gradient of f at x. 
We say that f has a constrained stationary point at x if Vrfz = 0. 


Proposition 19.4.2 Suppose that M is a differential manifold in an open 
subset U of a Euclidean space E, that f is a continuously differentiable 
function on U and that x © M. If f has a constrained local maximum or 
minimum at x then f has a constrained stationary point at x. 
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Proof Suppose that h € T,,. Then there exists 6 > 0 and a curve wz, : 
(—6,6) > M such that w,(0) = a and (0) = h. Then f ow, has a local 


maximum or minimum at 0, so that, by the chain rule, 


(h, Vrfe) _ (h, V fe) = Dfzx(h) — Dfx(v'(0)) = D(f ox) =0. 


Of course, Vy fz need not vanish. Indeed, if Uz, is a neighbourhood of x 
in U and g: U, > F is a submersion for which 


MU, = {fy € Uz: g(y) = 0}, 


then trivially every point y of MMU, is both a constrained local maxi- 
mum and a constrained local minimum of g, and is therefore a constrained 
stationary point of g, while V yy is a linear isomorphism of N, onto Dg,(£). 


Theorem 19.4.3 Suppose that M is a differential manifold in an open 
subset U of a Euclidean space E, that f is a continuously differentiable real- 
valued function on U and that x © M. Suppose that Uz is a neighbourhood 
of x inU and g: U, — F is a submersion for which MN U;z = {y € Uz : 
g(y) = O}. If f has a constrained local maximum or minimum at x, then 
there exists a unique o in F' for which V(f — (¢,g))z = 0. 


Proof Since Dg, is a linear isomorphism of N, onto F’, (Dg/,) is a linear 
isomorphism of F onto N,. Thus there exists a unique @ € F such that 
(Dg!.(¢)) = Vn fe, and so 


(h, Vn fz) = (6, Dgz(h)) for all h € Ny. 
But if h € N,z then 
(¢, Dgx(h)) = D((o, 9) )a(h) = (h, Vn ((G, 9) Je) » 


and so Vn(f —(%,9))2 = 0. Since Vr(f — (4, 9) )e = Vrfa—Vr((¢,9) a = 0 
it follows that V(f — (¢,9))x = 0. 
Finally, the proof shows that ¢ is unique. 


Corollary 19.4.4 If F =R¥*, andg = (g1,..., 9), then there exist unique 
Aq,---;Azk in R such that V(f — ae Aggy) = 0. 


The quantities \1,...,Az are called Lagrange multipliers. Lagrangian mul- 
tipliers provide a powerful tool for finding constrained local maxima and 
minima, as the following examples and Exercises 6 and 7 show. 
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Example 19.4.5 Diagonalizing a real quadratic form. 


A quadratic form Q on a real vector space F is a real-valued function 
on FE for which there exists a symmetric bilinear form 6 on E for which 
Q(x) = b(x,x), for all x € E. Suppose that EF is a Euclidean space. Then 
since Q(z + h) = Q(x) + 2b(x, h) + b(h,h), and since b(h, h) = o(|All), Q is 
continuously differentiable, and DQ,(h) = 26(2,h). 


Theorem 19.4.6 Suppose that Q is a real quadratic form on a 
d-dimensional Euclidean space E, defined by a symmetric bilinear function 
b. Then there exist an orthonormal basis (e1,...,ea) of E and real numbers 
Ay > Ag > +++ > Aq such that 


Q(x = yr? +--+ + Agr for all x = x,e, +--+ + 2¢eq € E. 
1 d 


Proof The proof is by induction on d. The result holds if d = 1. Suppose 
that it holds for d—1, and that Q is a quadratic form on a d-dimensional 
Euclidean space E. Let S¢~! be the unit sphere 


St) = {x € E: g(x) = |le||? = 1}. 


Since Q is a continuous function on E and S¢~! is compact, Q attains its 
maximum on S¢~! at a point e; of $4. The tangent space T., to S4~! is 
er. Thus 2b(e1, h) = DQ-, (h) = 0, for h € et. Consequently, if « = 21e, +h, 
with h € e+, then Q(x) = A127 + Q(h), where \y = Q(e1). 

We now apply the inductive hypothesis to the (d — 1)-dimensional space 
ey: there exists an orthonormal basis (e2,...,¢q) of et and real numbers 
Ag > +++ > Aq such that 


Q(y) = Ao} + +++ + Aayd for all y = yoeo + +++ + yaea € EY. 


Then (e1,...,€q¢) is an orthonormal basis of FE. Since A; is the supremum 
of Q on eet. Ay > Ag > +++ > Ag and 


Q(x) = Arey +--+ + Agag for all x = x11 +--+ + taea € E. 


We can use this to say more about the relationship between local maxima 
and minima and stationary points. Suppose that f is a twice continuously 
differentiable function on an open subset U of a d-dimensional Euclidean 
space /, and that x is a stationary point of f. Then, for small h in E, 


f(e@+h) = f(a) + D*fe(h,h) + r(h), 
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where r(h) = o(|lh||?). The mapping Q(h) = D?f,(h,h) is a quadratic form 
on F, and so there exist an orthonormal basis (e1,...,¢q) of E and real 
numbers Ay > Ag > ++: > Aq such that 


D? f,(h,h) = Ayh? +--+ + Agh2 for all h = hye, +--+ + hgeg € E. 
There are now five possibilities. 


1. Aj > 0 for 1 <j <d. Then f has a strict local minimum at 2. 

2. A; > Ofor 1 <j <dand Ag = 0. Then f can have a strict local minimum 
at x (consider the function x? + 2$ on R? near (0,0)), a local minimum 
which is not strict (consider the function x? on R? near (0,0)) or can 
take positive and negative values in any neighbourhood of x (consider 
the function x? — 2} on R? near (0,0)). 

3. Ay > 0 and Ag < 0. Then f takes positive and negative values in any 
neighbourhood of x. In this case x is called a saddle point of f: consid- 
eration of the graph of the function f(x1,72) = x? — x2 explains this 
terminology. 

4. 4; <0 for 1 <7 <dand A; = 0. As case 2. 


5. Aj <0 for 1 <j <d. As case 1. 


Similar results hold for constrained stationary points. 


Figure 19.4a. A saddle point. 


Example 19.4.7 Maximum entropy. 


Suppose that P is a probability defined on a set {21,...,2%} of n points, 
and that P(x;) = p; (so that p; > 0 and pj + --- + pn = 1). Suppose that 
f is a continuous strictly concave function on [0,1] which is continuously 
differentiable on (0,1). For what choice of P is F = )7"_, f(p;) amaximum?. 
First note that if we set h(t) = f(t) — f(0) then >°_, h(pj) = F —nf(0), 
and so we can suppose that f(0) = 0. 
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Consider the function F(x) = )7"_, f(xj) on U = (0,1)", the function 
g(x) = 414+---+4n, and the manifold M = {x € U: g(x) = 1}. Suppose 
that p € M is a constrained stationary point of F’. By Corollary 19.4.4 there 
exists A € R such that V(F — Ag)» = 0. That is, 


OF 
f'(p;) = Bn, WP) =Aforl<j<n. 


U5 
Thus f’(p;) = f'(p;) for 1 < i,j <n. Since f is strictly concave, f’ is strictly 
monotonic, and so p; = p; for 1 < i,7 <n. Thus p = (1/n,...,1/n), P is 
the uniform distribution on {x1,...,@%,} and F(x) =nf(1/n). It is easy to 
verify that this is a constrained local maximum. 

We need to verify that this is the maximal value on M. But if y € OM, and 
yj #0 for k values of j, then the same argument shows that dopa flys 
kf(1/k), and kf(1/k) < nf(1/n), since f is strictly concave. 

If P is a probability measure on {21,...,2%,}, then the Shannon entropy 
of P is defined as — 4 p; logy p; (where logs is the logarithm to base 2: 
logs(t) = log t/ log 2, and 0. logy 0 = 0). It follows that the maximum entropy 
occurs when P is the uniform distribution. 


Example 19.4.8 The reflection of light. 


One of the properties of light is that, in a homogeneous medium, it travels 
in straight lines. Suppose that M is a closed (d— 1)-dimensional differential 
manifold in a d-dimensional Euclidean space EF, and that P and Q are points 
in FE for which the line segment [P,Q] is disjoint from M. What is the 
shortest path from P to Q which includes a point of M? This is a question 
which can be solved using a Lagrange multiplier, but not quite in the way 
that might be expected. Let us give a concrete example. 

Let U = {x €R?: 2, >0,22 >0} and consider the two-dimensional differ- 
ential manifold M = {x € U: f(x) = x12 = 3V2} (the ‘mirror’) in U. 
Let P = (—1,0,0) and Q = (1,0,0). What is the shortest path from P to Q 
which includes a point of 17? We need some results from Euclidean geometry. 
Suppose that a > 1. Then the set Ey = {x € R?: ||z — P||+ ||2 — Q]| = 2a} 
is the ellipsoid 


3, = 
{gE R": g(z) = 5 2-1 


1L=01, 

which is a compact two-dimensional differential manifold in R°. (See 
Exercise 19.4.4.) Where does the function f attain its constrained maximum 
on E,g™U? By Corollary 19.4.4, if x is a constrained stationary point, there 
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exists a unique A in R such that V(f — Ag)z = 0. That is, 


2021 2A0X2 2023 
_ = 0, — = 0, = 0. 
“2 a2 ag az —1 
Thus 
2x? 2x2 
£3 and 1% a a — 1’ 
so that 
2 2 2/2 
a a = 5 and 2123 = —{—_ 


We require x to be a point of M. This happens if a = 3, from which it 
follows that 


r= 3V2/2, ro = 2 and 73 = 0, 
and the length of the path is 6. Note that we do not need to calculate X. 


XjXq = 3\2 


Figure 19.4b. Reflection in a hyperbolic mirror. 


Why are the multipliers called Lagrange multipliers? During the sec- 
ond half of the eighteenth century, the Piedmontese mathematician Joseph 
Lagrange developed a new formulation of Newtonian mechanics. Sup- 
pose that we are considering an ensemble of N particles P,,... Px, with 
masses m 1,...™my. In the Newtonian formulation, we consider the positions 
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Z1,...xy, and consider the second-order equations 


o ; 
qa (is) = F;, where F; is the force on P;. 
The positions 71,...,2y are represented as a point in R®’, each particle 


having three coordinates. One of Lagrange’s insights was that it is help- 
ful to consider the particles’ velocities, and their coordinates, on an equal 
footing. Thus each particle is represented by six coordinates, with three 
for its position and three for its velocity. Another coordinate is needed 
for time. Thus the configuration is in R°V+! = X x W x T, with basis 
(e1,..-,€3N, f1,---;f3n,h), where the jth particle has 


position 9) = ©35-2€3j-2 + %3;-1€3j;-1 + ©35€3; 


and velocity vy) = U33-2f3j-2 + U3j-1f 37-1 + U3; F35, 


and hf is the unit vector in the time direction. It follows from Newton’s laws 
that if there are no constraints on the variables then the equations of motion 
are given by 
OL _d/(OL 
On; i dt Ov; 
where L is the Lagrangian, defined as L(z,v,t) = T(a,v,t) — V(a,v,t), 
where T(x, v,t) = soy mvs is the kinetic energy of the ensemble, and 


) and vj = for <s<n, 


V(za,v,t) is its potential energy. We consider these equations in an open 
subset U = Ux x Uw x I of R®N*!, 

Let J : X — W be the linear isomorphism of X onto W defined by 
J(e;) = fj for 1 < 7 < N. We can then write the equations of motion in 
terms of the gradients Vx in X and Vw in W. They become the equation 


HV xb) = L(VwL). 


There may however be constraints of various kinds (for example, the 
distance between two particles may remain constant). We consider only 
holonomic constraints, which only involve the positions of the particles, but 
neither their velocities nor time, and which are given by a submersion g of 
Ux into a Euclidean space F'. Thus Mx = {x € Ux : g(x) = 0} is a manifold 
in Ux, and M = Mx x Uw x I is a manifold in R®+!. If « © Mx, let T, 
be the tangent space at x, let Vr, be the corresponding gradient in T),, and 
let J, be the restriction of J to T;,. Then the equation of motion becomes 


d 
Je(Ver,L) = 5 (Vw). 
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Then, arguing as in Theorem 19.4.3, there exists a unique ¢, € F' such that 


Je Vx(L~ (62,9))) = S(VwL). 


Suppose that F = R*, that g = (m,...,gg) and that ¢, = (A1,...,Az). 
Then the equations of motion become 


OL d peks 


2a 
Ni 1<97<N. 
Oa; ~ dt (Fy? “yas Ss j 


The Lagrange multipliers may vary with time, but their time derivatives do 
not enter into the equations. Let us give a simple example. 


Example 19.4.9 The pendulum in three dimensions. 


Suppose that a single particle, of mass m, is attached by a light rod of 
length / to a fixed point, which we take to be the origin in R, and swings 
freely, under the influence of gravity. We take rectilinear coordinates, with x3 
in the vertical direction. Then T(x, v) = 3m(v? + vf + v3), V(az,v) = mgx3 
and the constraint g is given by x? + 23 + a: = [?. The equations of motion 
then become 


dv 
—m— = 2 
dt a 
du 
—m— = 2AX2, 
dv3 
—mg — nm = 2X 


In fact there are solutions for which v3 = 0, so that x3 is constant. Then 
A = —mg/2zx3, so that A is constant. We require mg < 2Al so that —l < 
x3 < 0. Let w = \/2\/m. Then a solution is given by x; = Acoswt and 
= Asinwt, where A? = [? — 22. 


Exercises 


19.4.1 Suppose that T is a self-adjoint linear operator on a d-dimensional 
Euclidean space &. Show that there exists an orthonormal basis 
(e1,...@qa) and real numbers A; > A2 >... > Ag such that T(e;) = 
Ajej for 1 <9 <d. 

19.4.2 A linear operator on a complex inner product space F' is self-adjoint 
if (T(x), y) = (x, T(y)) for x,y € F. Suppose that T is a self-adjoint 
linear operator on a d-dimensional complex inner product space F’. 
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19.4.3 


19.4.4 


19.4.5 


19.4.6 
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Show that (T(x), x) is real, for all x € F’. Show that there exist an 
orthonormal basis (e;,...eq) and real numbers Ay > A2 >... > Aa 
such that T(e;) = A;e; for 1 <9 < d. 
Let I be the plane {x € R® : 2\/221/3 + xo = 4}. Let P = (1,0,0), 
Q = (—1,0,0). Determine the minimum of 
{||2 — P| + |e — Ql] : & € TT}. 
What is the geometric significance of your result? 
Let P = (1,0,0), Q = (—1,0,0). Suppose that a > 1. Let 
Tg? = {(a?, 22,23) : 22,23 € R} and 
T_g2 = {(—a*, #2, 23) : 22,23 € R}. 
Determine the sets of points 
{x € R? : a||x — P|| = d(x, T,2)} and 
{x € R® : a||x— Q|| = d(x, _a2)}. 
Deduce that 
2 


2 2 
9 o£ +x 
{a € R®: |lv— P| + |v — Q|| = 2a} = {a RB: S + 8 = 1}. 


By considering the tension 7 in the light rod, establish the circu- 


lar movement of a pendulum as described above, using Newtonian 
methods. How are T and the Lagrangian multiplier \ related? 
Suppose that p,q > 0 and that 1/p+1/q=1. 

(i) Let Sy ={zeR?: Sa |x;|2 = 1}. Show that My is a compact 

differential manifold in R?. 

(ii) Suppose that a; > 0 for 1 < j < d. Let f(x) = yo 4 uae. 
Use a Lagrange multiplier to find the point x € Sg at which f 
attains its constrained maximum on Sq, and find the constrained 
maximum. 


(iii) Deduce Holder’s inequality 
d d 1/p d 1/q 
Sllebl<s [Solar] -[ Solty, 
j=l j=l j=l 
and show that 
1/p 


d d d 
eS |a,;|P = sup S 45h, : » lz? <1 
j=l j=l j=l 
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(iv) Deduce Minkowski’s inequality: 


1/p ¥ 1/p , 1/p 


d 

d 
So lag +P | <[>- lay + [>= la? 
g=1 j=l j=1 


19.4.7 Suppose that p,,...,pq are positive numbers for which i Ljpg = 1; 
(i) Show that 


A= {x eER?: pe |; |? 


ja PS 


is a compact differential manifold in R?. 

(ii) Let p(x) = ea x;. Use a Lagrange multiplier to find the point 
x € A at which p attains its constrained maximum on S,, and 
find the constrained maximum. 

(iii) Establish the generalized arithmetic mean-geometric mean 
inequality: if a; > 0 for 1 <j <d, then 


d 
1/pr 1/pa 
ay... Ag = \  ayp;, 


19.4.8 If R is the point of M in Example 19.4.8 for which the path PRQ has 
minimal length, show that PR and QR make the same angle with 
vectors normal to M at R. (Consider a reflection in the tangent space 
at R.) 


19.5 Smooth partitions of unity 


The definition of a differential manifold is a local one. When we turn to inte- 
gration, we need to combine local results. For this, we use smooth partitions 
of unity. 

Suppose that K is a compact non-empty subset of a Euclidean space E, 
and that {U,,...,U;,} is a finite open cover of kK. A smooth partition of 
unity subordinate to the cover is 


e an open subset V such that kK C V C E105, 
e a sequence (L5)F 4 of compact sets, such that L; C U; for 1 <j <k, and 
VC UF_jL;, and 
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Figure 19.5a. Partition of unity. 


e a sequence ( fie of smooth non-negative functions on V such that 
fj(x) = 0 for « € V \ Lj, and such that pe ie) = lier ee Vv 


Theorem 19.5.1 Jf K is a compact non-empty subset of a d-dimensional 
Euclidean space E and {Uj,...,U,} is an open cover of K, there exists a 
smooth partition of unity subordinate to the cover. 


Proof We break the proof into several steps. Let U = UF _ Uj. 


Lemma 19.5.2 Suppose that x € U;. There exists a compact subset Ly of 
U; and a non-negative smooth function hz on U such that hz(x) > 0 and 
such that hz(y) =0 for y €U \ Ly. 


Proof Let (e1,...,eq) be an orthonormal basis for E. There exists 6 > 0 
such that Lz = {y € E : |y,—2a;| <6 forl <i < d} C Uj. fy € U and 
1<j<hk, let 


hee(y) = { SPOME — lve 2e!?)) for ls — 2 <4, 
ia lY 0 otherwise. 


As in Volume I, Section 7.6, hi,. is a smooth function on U. Let hy = 
nee hi. Then Lz and hy satisfy the requirements of the lemma. 


Lemma 19.5.3 There exist compact sets {Kj : 1 < j < k} such that 
Kk; CKNU; for 1 <j <k and such that K = Us_ Ky. 


Proof Since Kk is bounded, we can suppose that each U; is bounded. Let 


Ujn = {y = U; : d(y, EB \ U;) > 1/n}. 
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yA 


a. 08 0 0.5 1 


Figure 19.5b. A bump function in one dimension. 


Then Uj, CG U;, each sequence (U;,)?2 is increasing, and the sets {Uj,n : 
1<j<k,n € N} form an open cover of kK. There therefore exists N ¢ N 
such that K C Uf_,Uj.w. Let Kj = K Uj. Then {Kj :1 <j < k} 


satisfies the requirements of the lemma. 


Lemma 19.5.4 For each 1 <j < k there exists a compact set L; such 
that Kk; GC L; C Uj; and a smooth non-negative function h; on U such that 
h(x) > 0 forx € K; and h;(y) =0 fory €U \ L;. 


Proof For each x in K; there exist a compact set L, and a smooth non- 
negative function h, on U which satisfy the conditions of Lemma 19.5.2. Let 
Uz = {y €U : he(y) > 0}. Then Uz C Lz, and {Uz : « € K;} is an open 
cover of K;. There exists a finite subcover {U, : x € Fj}. Let Lj = User, Le 
and let hj = \l,epha. Then L; and h; satisfy the requirements of the 


lemma. 


We now complete the proof of the theorem. Let h = YS hj, and let 
V = {y €U: h(y) > O}. Then V is an open set, and K C V CU. Let 


fi(y) = hy(y)/h(y), for y € V. Then (V, (L;)#_1, (f;)$=1) is a partition of 


unity subordinate to the cover. 


Suppose that K is a compact non-empty subset of a Euclidean space E, 
that {U;,...,U;} is a finite open cover of K and that (V, (L;)F4, (Fs )R-1) is 
a smooth partition of unity subordinate to the cover. Suppose further that 
F is a Euclidean space and that, for each 1 < j7 < k, gj is a function from 
U; to F. We then define ae fj9;:V — F by setting 


k 
Y= figi(y) = SL fi(y)gi(y) sy € V OU;}. 
j=l 
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In other words, we set (fj9;)(y) = 0 if f;(y) = 0. Then the function 
ee fj9; is continuous (continuously differentiable, smooth) if each of the 
functions g; is. 


Exercise 


19.5.1 Suppose that FE is a Euclidean space and that 6 > 0. Let 


ny) < £ PCCW = lvl) for Ill <5 
0 otherwise. 


Show that h is a non-negative smooth function on E. Identify the set 
{y : h(y) > Of. 


19.6 Integration over hypersurfaces 


We now turn to integration. We restrict attention to a special case; through- 
out this section, we suppose that M is a connected compact d-dimensional 
hypersurface in a (d + 1)-dimensional Euclidean space E. We need one 
topological property of such hypersurfaces, which we state without proof. 


Theorem 19.6.1 (The Jordan—Brouwer separation theorem) Suppose 
that M is a connected compact d-dimensional hypersurface in a (d + 1)- 
dimensional Euclidean space E. Then the open set E\ M has two connected 
components: one, out|M], the outside of M, is unbounded, and the other, 
in|M], the inside of M, is bounded. 


When d = 1, so that M is the track of a simple closed curve in a two- 
dimensional Euclidean space, this is a special case of the Jordan curve 
theorem, which is proved in Volume III. 

If M is a connected compact d-dimensional hypersurface and x € M then 
the normal space N,, is one-dimensional, and so there are two elements nt 
and n; in N, of norm 1. We choose nz so that x + Anz is outside M for 
small positive values of A; then «+ An; is inside M for small positive values 
of X. 

Theorem 19.6.1 has the following useful consequence. 


Theorem 19.6.2 Suppose that M is a connected compact hypersurface 
in a Euclidean space E. There exists an open set W containing M, and a 
submersion g: W + R such that M = {y © W: g(y) =O}. 


Proof Since M is compact, there exist a finite open cover {Uj,...,U,} of 
M by open subsets of E and submersions g; : U; — R such that MMU; = 
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{y € U; : gj(y) = 0}. We can also suppose that MMU; is a connected subset 
of M, for each j. If x € MN Uj, let 


dj(x) = ((V 95) Me) = ss (2). 


Then A; is a continuous function on MMU;, which does not take the value 0. 
Replacing g; by —g; if necessary, we can therefore suppose that A;(a) > 0 for 
all ¢ € MNU;j. There exists a smooth partition of unity (V, (L;)F4, (f;)R-1) 


subordinate to the cover. Let g = ey fjg;. Then M = {y € V:: g(y) = O}. 
If « € M, there exists i such that « € MMU; and f;(x) > 0. Then 


(Von, nz) = SH > fi(w)di(x) > 0. 


Thus Vg is non-zero on M. Let W = {y € V : Voy # 0}. Then W is an open 
subset of V containing M, and the restriction of g to W is a submersion. 


We now consider integration. First, let us describe the notation that will 
be used. If f is a Riemann ae function on an open subset U of E, we 
denote the integral by tet y) dva+i(y). On the other hand, the integral of 
a function g on M will be eel by fy g(x) doa(x): thus dug,1 represents 
a (hyper)volume integral and dog represents a (hyper)surface integral. 

We begin locally. Suppose that « € M. Take an orthonormal basis 
(€1,.--,;€g41) for E, where (e1,...,¢€q) is an orthonormal basis for T,, and 
€qi1 = nt. We shall consider cells in F defined in terms of this basis. There 
exists 6 > 0 such that if U, is the cell 


{ye EF: |y;-—2;|< 6 forl1<j <d+1} 
and ¢z(y) = P.(y — x) for y € M then 


e (Uz, ¢z) is a chart near z, 
¢ |jnt - nel < } for ye MNUz, so that (nt, nt) > 4, and 

© |l(y—2) - ae YI < |léx(y)|| for y € MNU;, (this is possible by Corollary 
19.2.4). 


Let w : d:(Uz) > MOU, be the inverse mapping, and let 
Ve= {0 € 7, |u| <4 for 1 < 7 < a}. 
It follows from the third condition that if y € MMU, then 


IYa+1 — Za+1| = Il(y — x) — bx(y)|| < max{|yj; — 2j|:1 <7 < d}, 
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and it follows easily from this that ¢,(Uz,) = Vr. 
We call such a cell Uz a well-behaved neighbourhood of x. 
Suppose now that 0 < a < 6, that 


Ly ={y¢€ E:|y; -—2;|< a forl1<j<d+1}, 


and that g is a continuous function on M which vanishes on M \ L,. Suppose 
that y € MOL, and let ¢,(y) = v. We now choose a new orthonormal 
basis (f1,..-, fa41) such that (fi,..., fa) is an orthonormal basis for T,, 
joa = thy = mp and ay € span{ fa, fazi}. Suppose that h is a unit 
vector in T,. There exists 7 > 0 such that v+th € V, for |t| < 7. Then 
w(u+th) € M for |t| < 7, and so Dy,(h) € Ty. Thus the restriction of Dy, 
to T;, is a linear mapping from T;, to Ty, and so the Jacobian J7, is defined. 


We now define 


J su)acaty) to bef g(u(v))|¥| dua. 

Mrw, : 

Let us determine the value of |Jy,|. Let C = I, x --- x Ig be a cell in T,, 
defined in terms of the new basis nt vng Japi)> Then Dy, (C) is a cell in 7, 
with sides of lengths I(11),...4(Ia-1),'(Ia)/ (nf ,nt) (see Figure 19.6), and 


so |.Jey| = 1/ (nj, nz). Thus 


| vay, Sed ea) = [ . Toko) dva(v). 


It is now a straightforward matter to use a smooth partition of unity 
to define the integral of a continuous function g on M. The well-behaved 
neighbourhoods U, of points « of M form an open cover of M, and so 
there is a finite subcover {Uz,,..., Ux, }. Let (V, (L:)*,, (fi)£,) be a smooth 


Figure 19.6. Change of variables. 


19.6 Integration over hypersurfaces 571 


partition of unity subordinate to the cover. We then define 


k 


oy ( [_, wna) toa) 7 


i=1 


| g(y) doa(y) to be 
M 


In particular, we write oq(M) for [,, doa(y), and write wy for oq(S%), 


where S$? is the unit sphere in a (d+ 1)-dimensional Euclidean space. 


It does however remain to show that the integral does not depend on the 


finite subcover, nor on the choice of smooth partition of unity. This is again 


a fairly straightforward matter. The details are indicated in the exercises 


below. 


Exercises 


We consider the setting described above. 


19.6.1 


19.6.2 


19.6.3 


19.6.4 


Suppose that z € MM U,. Show that i as) = $ for y Ee MNU,. 
Show that if we follow the procedure above, projecting onto T, rather 
than T;,, then 


| gly) doa(y) = | (Hz (0))|(Jbz)ul dva(v). 
MU; 


z 


Suppose that {W,,,,..., Wz, } is a finite cover of ZL, by open cells and 
that (V, (L;)*,, (fi)£,) is a smooth partition of unity subordinate 
to the cover. Show that 


/ o(y) doa(y) = 
MnuU, 


Suppose that {W,,,...,Wz2,} is a finite cover of M by open cells 
with the properties described above and that (V, (Li)*_,, (fi)£) is 
a smooth partition of unity subordinate to the cover. By considering 
the open cover {U; NW; :1<i< k,l <j < Uj, and using the 
previous exercises, show that J, 9(y) doa(y) does not depend on the 


k 


2 ( i; vow, vate toa) | 


i=l 


choice of cover or the choice of smooth partition of unity. 

The following extended exercise shows how to establish a change of 
variables formula, in the simplest, but most useful, case. We con- 
sider the unit ball By and the unit sphere S¢~! in a d-dimensional 
Euclidean space FE. Let A, be the annulus {x : r < ||z|| < 1}, for 
0 <r <1. Suppose that f is a continuous real-valued function on 


S¢-! and that g is a continuous function on By. Let f(rw) = f(w) 
forO<r<landwe S?!. 
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(a) Show that f is a continuous function on Bg \ {0}. 
(b) Show that 


i) eet )dogi(w =f F(a) dvg(a 
(c) Suppose that € > 0. Show that there exist 0 = 19 <--+ <r, =1 
and a function h on By \ {0} such that 
(i) the function hj(w) = h(rjw) is continuous on S41, for 1 < 
j<k, 
(ii) h(tw) = h(rjw) for w € S41 and rj_-1 <t<r;, forl<j< 
k, and 
(iii) |A(x) — g(a)| < € for x € By \ {0}. 
(d) Show that 


[ g(x) dvq(x) = [ Ga fle) dow-alw)) is: 


r 


[a dvag(x) = i Ga ghee doa-a(w)) ae 


19.6.5 Use the previous exercise to calculate wg. 

19.6.6 Suppose that U is a bounded convex open neighbourhood of 0 in a 
d-dimensional Euclidean space FE, that M = OU is a (d — 1)- 
dimensional differential manifold such that for each x € M, (x,nt) > 
0. Suppose that f is a continuous real-valued function on U and that 


(e) 


g is a continuous real-valued function on U. 

(a) Show that for each w € S¢! there exists a unique r,, > 0 such 
that 2 =ryw € M. Let f(w) = f(zy). (Note that r, = |||.) 

(b) Show that 


_f eal’ Fe) 
| fla) doaa(e) = f J wu), 


(c) Obtain a formula for J g(x) dva(x) corresponding to the formula 
in Exercise 19.6.4. 


19.7 The divergence theorem 


Suppose that U is an open subset of a Euclidean space EF. A vector field F’ 
on U is a continuously differentiable mapping from U into E. For example, 
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if fisa C®) function on U , then the gradient mapping Vf is a vector 
field. If F is a vector field on U for which there exists a C) function f, for 
which F = Vf, then F is called a conservative vector field, and f is a scalar 
potential for F. Another important example occurs in R?. Suppose that U 
is an open subset of R® and that f = (f1, fo, fg) is a C®) vector field on 
U. Let 


wx Ate) = (F2@-2e, Ae - Fe. 20 - se). 


Then V x f is a vector field, the curl of f, on U. Compare this with the cross- 
product of two vectors defined in Appendix C. Such vector fields occur in 
mathematical physics, in electromagnetic theory, in gravitation and in fluid 
dynamics. We consider curl further in Section 19.9. 

If F' is a vector field on an open subset U of a Euclidean space EF and 
x €U, then DF, is a linear mapping of F into itself, and DF is a continuous 
mapping of U into L(E). We define the divergence V.F(x) at x to be the 
trace of DF,. If (e1,...,ea) is a basis for E, and if F(x) = yy Filz le. 
then DF is represented by the matrix (Of;/0x;), so that 


It is important to note that this formula does not depend upon the choice of 
basis (and that the basis need not be an orthonormal basis), since the trace 
does not depend on the choice of basis. (See Appendix B.5.) A vector field 
F is said to be solenoidal if V.F = 0. For example, if f is a C®@) function 
on U then 


so that V x f is solenoidal. 
Suppose that f is a C®) function on U. Let F = Vf. Then f is harmonic 
if F is solenoidal; that is, if V.(Vf) =0. We write V?f for V.(Vf): V7f is 
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the Laplacian of f. If (e1,...,ea) is an orthonormal basis for FE, then 
d 
af 
2 = 
V° f(z) = By2 (*): 


Let us give two examples which we shall need later. 


Example 19.7.1 Suppose that EF is a d-dimensional Euclidean space and 
that a € R. 
(i) Let a(x) = |\a||?-°, for « € E \ {0}. Then 


2—a)(d— 
Vwa(z) = = VF \\° and V7 vax )= _ ae 
(ii) Let d(x) = log ||z|| for « € E \ {0}. Then 
d—2 
V(x) = — and V?(2) = 5 
|| || 
These are easy calculations. If (e1, +104) is an orthonormal basis for E 
and x = eae xe; then Wa(x) = (x 24 4+ 22)!-4/2) so that 
OVa x 
—(xr) = (2- and Vw = (2—a) 
ae; °) = @- e)pat C- OTF 


Further, 
ry ( 1 x ) 
aot) =2=0)(—— 3 -o— 
an? =O 9) \ Tape ae? 


Adding, V*Wa(x) = (2 — a)(d — a) ||a||~* 

The calculations for (ii) are left as an exercise for the reader. 

In particular, the function / |||? on E \ {0} is solenoidal. If d = 2 then 
@ is harmonic, and if d > 2 then wg is harmonic. 

We shall consider these ideas further in the next section. First, we 
prove the divergence theorem, which is a multi-dimensional version of the 
fundamental theorem of calculus. 


Theorem 19.7.2 (The divergence theorem) Suppose that V is a connected 
bounded open subset of a (d+1)-dimensional Euclidean space E whose bound- 
ary M is a finite disjoint union of connected hypersurfaces My,..., My and 
that F is a vector field defined on an open set U containing V. Let B=V, 
and let 


k 


a (F(z), nz) doa(x) a > (/, (Pia). ) iat) ) 


j=l 
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where nz is the normal at x in the direction out of V. Then 


| V.F'(y)dva+i(y) =) (P@) ns ) doa(z). 
B M 


Proof Note that we need to show that B is Jordan measurable, so that the 
first integral makes sense. This will emerge during the proof of the theorem. 

We use a smooth partition of unity. For each x € M there exists a well- 
behaved neighbourhood U, of x, and for each y inside M there exists an 
open cell Cy such that Cs C in[M]. Together, these sets form an open cover 
of B; since B is compact, there exists a finite subcover 


(Usiss : alas C. 


Yrt12°* 9 


Cy,) = (V1,..- Vi) say. 


We consider a smooth partition of unity (V, (L1,...,Z7), (f1,.-. fy) subor- 
dinate to the cover. Then 


l 
a V-Flaldeasi() = Do [ faly)V-F(y)dvays(y) 


and 


(F(x),nt) doa(z -y “Dow, F(a),nz) dog(a). 


Now let us set G; = f;F. Since f;(y) = 0 for y ¢ Lj, we consider G; as a 
function on EF, setting G;(y) = 0 for y ¢ L;. Then 


V.G3(y) = (VEY), FY) + HWV-FY) 


M 


so that 
1 


I l 
\_\V.G; = (3 Vint) +|Sof | VPHV-F, 
j=l i=l j 


since YS fj = 1 and 2S Vf; = V1=0. It is therefore sufficient to show 
that 


[ VGi)dowsty) =f (Gile).nz) doe) 
Bow, M(Uz, 


for 1 <j <k, and that 
| ¥-Gitu deassly) dy = 0, 


fork+1<j<l. 
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We deal with the second set of equations first. By taking a suitable 
orthonormal basis of £, we can suppose that 


Ve =1wek: ay — (yy <0 for 1a 1} 
and L; = {w ¢ EB: \wi— (y;)i| Sa for 1 <i <d+ I}. 


Let G; = (91,---9a41). Then 


Oi Wigsve A Ug)a > exis WA) — OW sey (GEM Oh Wa) = 0; 


Adding, it follows that 
; VOGinut) =O. 


We now turn to the first set of equations. To simplify the notation, we 
can suppose that x; = 0. We drop the suffix j; we denote U,, by U, Lz, by 
L, the tangent space T;, by T, the orthogonal projection onto T by P, the 
restriction of P to MMU by 4¢, and the inverse mapping of ¢(M MU) onto 
MOU by w. 

It follows from Theorem 18.5.3 that BOU is Jordan measurable, so that 
the volume integral over BM U exists. Since U is a well-behaved neighbour- 
hood of 0, there is an orthonormal basis (er, ..+,€q41), where (€1,...,€q) 
is an orthonormal basis for T,, and egy; =nz We write G;(y) = G(y) = 


eu, gi(y)e;. Then 


d+1 29% 
V.G(y) du | “(y) du y 
(y) dvayi(y 2 Ee Dy, U) Waray) 
and 
d+1 


has SCC) ag Gale = ec ) (eine) doa(s). 
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It is therefore sufficient to show that 


| az (y) dvati(y) = / gi(x) (ei,nz) dog(x) 
B 


nu Ox; MnU 


for l1<i<d-+l. 
We make a (non-linear) change of variables. If y € U, let y(y) = w(P(y)) 
and let 6(y) = x(y) — P(y). If y = 2 + Aegy1, with z € T, then 


O(y) = (2) = (2) — 2 € span (ed41). 


From this it follows that 


Me Geng oO =0forl<i<d1l<j<d+1. 
OXd+1 Ox; 


Now let S(y) = y — @(y). Then S(M MU) CT, S is a diffeomorphism of 
BoU onto S(BNOU) and S(BNU) C T x (—26,0). Further, it follows 
from the equations above that the Jacobian J(S') = 1. Let R be the inverse 
mapping from S(U) onto U. It is then sufficient to show that 


—26,0] Ox; 


| Ogi (R(y)) dvayi(y) = i ot) Aeron) daa: 
Tx|[ . ( 


for l<i<d-+l. 
First, let us consider the case where 1 = d+ 1. Then 


OX. 


O Ogg. 
[( Za (Ry) dias) Qigens Ua 


~2§ Od41 


BH 


i ous (R(y)) dvasi(y) = 
Tx [—26,0] 


= i ga+i(b(w)) doa(w) 
T 


: (casa, Bea) 
a a Cee 


Next suppose that 1 <i < d. Without loss of generality, we can suppose 
that 7 = d. First we fix all the variables y, with 1 < k < d. Suppose that 
Yk = ay forl <k<d.Ify €U, we set y = (a,s,t) and set w = (a, s,0). Let 
II be the plane 


dog(w). 


{ye E: yy, =a, for 1<k< d}, 
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and let U =UN, M=(MNU)OIL, B=(BNU)NIL. Then 


M = {(a,s,t) : t = (w(s),ea41)} and BC {(a,s,t) :t < v(s)}. 


=: -26 
Figure 19.7. The divergence theorem. 


Suppose that y = (a,s,t) € S(B). Let U = {(s,t) : (a,s,t) € S(U)}. 
If y = (a,s,t) € S(U), let h(s,t) = ga(R(y)), and if (s,t) € R2\ U let 
h(s,t) = 0. Then the partition of unity properties imply that h is continu- 
ously differentiable, and that h(d,t) = h(—6,t) =0 for t € R. If (s,t) € U, 
— ah a a OR 

5 (St) = Fe(R(y)) + S*(RW)) F—(). 


Now let Mw) = pps An(s)€~, where Az(s) = (CLR ay ) Since U is a 
well-behaved neighbourhood of 0, Agii(s) > 5: 


Let (u,v) be the unit tangent vector to the curve (s,h(s,t)), with u > 0. 
Then (0, u,v) is in the tangent space of R(T) at R(y). But this is the same 


as the tangent space of M at w(w), and so ((0,4, 0), ry) ) = 0; that is, 
urdg(s) + vAgii(s) = 0, and so OR/Os(y) = —Ag/Agy1. Now 


6 
one a ds = h(d6,t) — h(—6,t) = 0, 
so that 


Aa(s) O9a 
—5 Ad+i(8) Ot 


(R(a, s, t)) ds. 
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Integrating with respect to t, and changing the order of integration, 


[ ( 294 (P(g, s,t)) is) dt = , Aals) ga(w(a, s, 0)) ds 
_§ Os 


—26 _5 Ad+1(s) 


-{ 5 (eaninno) 


ga(w(a, s,0)) ds. 
—46 (east, fiat) 


Integrating over the remaining variables, 


(carbon) q 


29 = w Oq(w 
[ spray Deg) ea) = a galtb(w)) lesan) al). 


There are many consequences of the divergence theorem. 


Corollary 19.7.3 Suppose that M is a d-dimensional connected compact 
hypersurface in a (d+1)-dimensional Euclidean space E, that B = in|M] = 
M Uin|M] and that F is a vector field defined on an open set U containing 
B. Then 


[VF =f (P@.nb) doula). 
B M 


Proof This is a consequence of the Jordan—Brouwer separation theorem: 
M is the boundary of in(/). 


Corollary 19.7.4 If f is a C?)-function defined on U then 


[ Vrerdcas= | P50) doa). 
B M 


Proof Apply the theorem to Vf. 


Corollary 19.7.5 (d+1)vg4i(B) = fa, (eu, nt) dog(z). 


Proof Take F(z) =a. Then V.F =d+1. 


Corollary 19.7.6 (d+1)vai1(B,(y)) = roa(Sp(y)) = r@wa. 


Proof For (x —y,nt) =r. 


Corollary 19.7.7 If y«EV then 


+ 
[, 12M) aga) =e 


t ||z — yoll 


580 Differential manifolds in Euclidean space 


[ = w2 anyno 
M |l2 — yoll 
Proof If yo € E\V then (y—yo)/ ||y — yo||7*! is solenoidal on V, and the 
result follows from the divergence theorem. 
If yo € V, there exists ro > 0 such that if 0 < r < rp then B,(yo) C V. 


For such r, let V; = V \ B,(yo), so that OV, = M U S;,(yo). Since 
(y — yo)/\ly — yol|*** is solenoidal on V,, it follows that 


x — yo, nt L— Yo, Ny 
| (oe Wot dgy(x) = -f (evo) ag, (x), 
m |x — yoll S-(yo) ||z — yoll 


Bearing in mind that in this equation nv is pointing towards yo on S;(yo), 
so that (x — yo, nt) = — |x — yoll, it follows that 


If yo € E\ V then 


ee 


7 
os 


— 


XL — Yo, Ny i 
-f {e Yo, Re? dog(x) = -d7d(Sr(Yo)) = Wd 
S-(yo) [|Z — Yyoll 


Corollary 19.7.8 Ifyo€V then 


d+1/_. 1 tia ae 
c [aaa i P( ) 2) d al ) = V.F(y) 


as Tr \, 0. 


Proof There exists rp such that B,(yo) C V for 0 < r < ro. Applying the 
divergence theorem to the open ball N;,(yo), and using Corollary 19.7.6, 


oe 
va+1(B,(Yyo)) 


eee Ca) he 
~ raa(Sr(yo)) UL (F(a), nz) doa ) ' 


| V.F (y)dva+i(y) 
B,(yo) 


But 
1 


va+1(B;(Yyo)) 


and so the result follows. 


| 4 VP ideas) + V-Flan) a8 7 0 
B,. Yo 
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Note that 
— x nt ong(rz) = at. - ae enue 
oa(Sr(Yo)) = ea )s a) d a ) rg Le (F( ), se d al ) 


is the average value of (F(x),n7) on S,(yo). 
Suppose that f and g are C®) 


Corollary 19.7.9 (Green’s formulae) 
functions on U. Then 


Fa) (a) doate) = ff ((F0v), Vol) + FWV?alu)) dvaea(o) 


and 


[ (fw) V? aly) 7 ay) V?F(y)) dvay.(y). 


Proof Apply the theorem to H = fVg and kK = fVg—gVf: 


V.H = (Vf, V9) + fV'9 and V-K = fV°9—9V"f. 


Corollary 19.7.10 Jf U is an open subset of R® with boundary M con- 
sisting of a finite disjoint union of 2-manifolds, and if f is a C?)-function 


defined on an open set U containing B then 


[7 A@).nb) doa(a) =0. 
M 


Proof Apply the theorem to the solenoidal vector field V x f. 


Exercises 
19.7.1 Establish the formulae in Example 19.7.1 (ii). 
19.7.2 Suppose that G is a vector field on an open set U and that f is a 
continuously differentiable function on U. Show that 
V(FG) = (VF, G) + f(V.G). 
19.7.3 Suppose that f is a C®) function on an open subset U of R?. Show 


that V x (Vf) =0. 
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19.7.4 Suppose that F is a vector field on an open subset of R? for which 
there exists a vector field A (a vector potential) such that F = V x A. 
Show that F' is solenoidal. 


19.8 Harmonic functions 


In the previous sections, we considered functions defined on an open sub- 
space U of a (d + 1)-dimensional Euclidean space, and hypersurfaces of 
dimension d. In this section we change the dimension by 1; we consider 
functions defined on an open subset U of a d-dimensional space. 

Recall that a real-valued function f defined on an open subset U of a 
Euclidean space EF is harmonic if it is twice continuously differentiable, and 
Vet 

Harmonic functions have good averaging properties. Let us introduce 
some notation. If z € S,(a9) we denote by nz, the unit normal vector in 
the direction away from zo. If 0 < s < r we denote by As (xo) the annular 
set {2 € EB: 5 < ||x — zo|| <r}: it has boundary S;(x9) U S,(z0). 


Proposition 19.8.1 Jf f is a harmonic function on an open subset U of 
a d-dimensional Euclidean space E and B,(xo) C U then 


ar - 
i, Fag) dole) = 0. 


Proof Apply the divergence theorem to Vf. 


Theorem 19.8.2 Suppose that f is harmonic on an open subset U of a 
d-dimensional Euclidean space E and that B,(a%o) C U. Then 


ee 
oa—-1(S;(x0)) 


1 
ae ray I. F(a) dag_i(z). 


Proof We deal with the case where d > 2; the proof for d = 2 is essentially 
the same. Let 0 < s < r. Applying Green’s formula to f and wWa(x — x0) = 
lz — xoll?-%, 


jay= [ yf deae) 


0= | aly — 20) VF (y) — f(y) V2valy) dvaly) 
As,r 


= I, = Te, 


19.8 Harmonic functions 583 


where 


mS 
lI 


of Oa 
= (va 7 to) 5, ) = F(a) a Ce r0)) dog_1(2) 


-4 | 7 (ZA te) - @-afte)) dowalo 


-1? [ yh -10) 


using Proposition 19.8.1 and the equation 


OW 2—d)r 2-d 
Similarly. 
— i Cz = ooh (x) — fo) (x — ) dog_1(2) 


d—2 
= a f(x) dog_1(x). 


Since I, — (d — 2) f(x9)og_1(S*!) as s \, 0, the result follows. 


This theorem has the following consequence. 


Theorem 19.8.3 Suppose that f is a non-constant harmonic function on 
a connected open subset U of a d-dimensional Euclidean space E. Then f 
has no local maximum or minimum. 


Proof Suppose that f has a local maximum at yo and that f(yo) = a. Let 
F={yc€U: f(y) =a}. Since f is continuous, F is a closed subset of U. 
We show that F' is open. Suppose that y € F’. There exists r > 0 such that 
B,(y) C U. Suppose that z € B,(y) and that ||z — y|| = s < r. Then f(y) 
is the average value of f on S;(y). Since f is continuous and f(y) > f(w) 
for w € S;(y), it follows that f(w) = f(y) for all w € S,(y). In particular, 
f(z) = f(y) =a. Thus B,(y) C F, and so F is open. Since U is connected, 
F =U, and f is constant. 
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We now consider the simplest form of the Dirichlet problem: if f is a 
continuous function on the unit sphere S¢! in R4, is there a continuous 
function u on the closed unit ball Bg which is harmonic on the open unit 
ball Ug and is equal to f on S¢~!? For this, we need the Poisson kernel. 
This is defined as 


2 
1—|IyIl 


a for x € S41 y € UG. 
wa-1 lly — 2| 


P,(y) = 


Note that P,(0) = 1/wg_-1. 
Proposition 19.8.4 P,(y) is a harmonic function of y in Ug. 


Proof We prove this by a direct calculation. Let v(y) = 1— ||y||? and let 
w(y) = 1/|le — yl|@. Then 


Du; (y) = —2yi, 
V7u(y) = —2d, 
Ow _ —d(y; — x4) 
ay = Ty — al 
Ow —d d(d + 2)(y; — 24)? 
Oy) = Tyna? yal 
Boeke . d(d+2) 2d 
and VO) = Ta? * yal yn el™ 


—2d 4d(y,y— 2x) . 2d(1—|ly|l?) 


= d d+2 d+2 
lly — 2 lly — 2l|°" lly — all" 
2d 2 
= Tye (lly al +2 (9 — 2) + — Ml )) = 0. 


Theorem 19.8.5 (i) P;(y) >0 fora € St!, y € Ug. 

(ii) If x € Sg_1 and 6 > 0 then P,(rz) + 0 uniformly on {z € S41: 
Iz —a|| > 6} asr Al. 

(it) foe. Poly) dog_1(e) = 1 for y € Ug. 
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Proof (i) and (ii) follow from inspection of the definition of the Poisson 
kernel. Suppose that y € U, and that y = rz, with z € S71. Since P,(y) is 
a harmonic function of y, 
1=wg4F,(0) = PAra)dogi (2). 
Sd-1 
But ||/rx — z|| = ||x — rz||, so that P,(rx) = P,(rz) = P,(y), and so 


i [ __ Pely) doa (2) 


Theorem 19.8.6 (Solution of the Dirichlet problem) Suppose that f is a 
continuous function on the unit sphere St! in R%. There exists a continuous 
function u on the closed unit ball Bg which is harmonic on the open unit 
ball Ug and is equal to f on St. 


Proof Let 
iii Soa f(@)Pe(y) dog_1(x) for y € U, 
f(x) for 2 € $41. 
Differentiating twice under the integral sign, we see that 
Veuly) = ff (2)V*P2(y) doa1(x) = 0, for y € U, 
Sd-1 


so that u is harmonic. It remains to show that u is continuous on B. It is 
certainly continuous on U. Let u,(x) = u(rx) for ¢ € S&! and 0 <r <1. 
Since each u, is continuous, it is sufficient, by the general principal of uniform 
convergence, to show that u, + f uniformly on S¢!. Suppose that € > 0. 
Since S¢-! is compact, f is uniformly continuous on S¢!, and so there 
exists 6 > 0 such that if ||z — z|| < 6 then | f(z) — f(x)| < €/3. By Theorem 
19.8.5 there exists 0 < ro < 1 such that |P,(rz)| < €/(3||f||,, wa_1) for 
ro <r <1 and |lz—2|| > 6. If g € St! and ro < r < 1 then, using the 
results of Theorem 19.8.5, 


Fa) —ur(a)l=| fh (F(@) — @))Palra) dow-a(e) 


< / aye f) ~ FOMPE(rD) doa ale) 


+f (f(z) + |f(2)|) Pra (2) doa-1 (2) 
l|z—all>6 


< €/3 + 26/3 =e. 
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Of course, we can prove a corresponding result for every ball B,(z). 
We now have a converse to Theorem 19.8.2. 


Theorem 19.8.7 Suppose that f is a continuous function on an open 
subset U of a Euclidean space E with the property that for every xp € U and 
€ > 0 there exists 0 <r <e such that B,(ao) CU and 


__ #.. 
oa—1(S;(x0)) 


Then f is harmonic on U. 


ene [ ag fH) eeeil@) 009 


Proof Suppose that xp € U, and let r > 0 be chosen so that B,(a9) C U 
and (x) holds. Let u be the solution to the Dirichlet problem for B,(xo), 
and let v = f —u. Then v(x) = 0 for x € S,(xq), and, since harmonicity is a 
local property, it is sufficient to show that u(y) = 0 for y € B,(xo). Suppose 
not, and suppose that v(x) > 0 for some x € B,(ao). Let a = sup{v(y) : 
y € B,(xo)}, and let F = {y € B,(z0) : f(y) = a}. Since B,(x9) is compact, 
F is a non-empty closed set in B,(xo), and FM S,(a#) is empty. Similarly, 
there exists yo € F such that ||yo — xo|| = s = sup{||y — xol| : y € F}, and 
0<s<,r. By hypothesis, there exists 0 < t < r—s such that (x) holds. But 


then 
il 


Ta-1(St(Yo)) i UE aN 


which is not possible, since u(z) < a for x € S;(yo), and there are points «x 
in S:(yo) for which v(x) <a. 


a=v(yo) = 


Note that the condition on f does not involve derivatives, but ensures 
that f is twice continuously differentiable. 


Corollary 19.8.8 A harmonic function f on U is infinitely differentiable. 


Proof Take an orthonormal basis (e1,...,eq) for FE. An inductive argu- 
ment shows that it is sufficient to show that Of/Ox,; is harmonic, for 
1<j<d. Suppose that xp € U and that B,(x9) C U. Then 


a 
od—1(S;(x0)) 


by Theorem 19.8.2. Differentiating under the integral sign, 


fc = i, ay ea 


Of. 1 of 
50) = ETD [ eg BH): 


and so Of /Ox; is harmonic. 
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Corollary 19.8.9 Suppose that (fn)? is a sequence of harmonic func- 


n= 
tions on U which converges locally uniformly to a function f. Then f is 
harmonic. 


Proof For if B,(xo) C U then, using Theorem 19.8.2, 


f(xo) = lim fa(xo) 
— 
n—r00 Og_1(S;(Xo)) 
1 . 
= SATETTEO do th Fol) doen 
1 


= [ og ft) 


and so the result follows from Theorem 19.8.7. 


II 
5 


i) pda aa 
S,(xo) 


We shall consider harmonic functions further in Volume III. 


Exercises 


19.8.1 What are the harmonic functions on an open interval of the real line? 
19.8.2 Suppose that f and f? are harmonic on an open connected subset of 
R“. Show that f is constant. 
19.8.3 Suppose that f is a harmonic function on an open subset U of a 
d-dimensional Euclidean space EL. Show that if B,(a#) C U then 
1 


f@) = Ey [ fava.) 


Show conversely that if f is continuous on U, and that if (*) holds 
for all B,(x) contained in U, then f is harmonic. 


19.9 Curl 


We now study the operator ‘curl’ in more detail. This requires knowl- 
edge of the material in Appendix C. Throughout this section, we suppose 
that F = (f1, fo, f3) is a vector field defined on an open subset V 
of R®. 


Proposition 19.9.1 Suppose that v is a unit vector in R°. Then 


(Ye Fay = VAF *4): 
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Proof If 7(1) = 2, 7(2) = 3 and 7(3) = 1, then e, = 1. Hence 


Of: Of 
(V x F,v) €o ) “") _ €o ‘ s Uo (1) 
oes * Oo (2) ced : 


Of 
= » Eo Cee Ae aee 


This clearly corresponds to properties of the scalar triple product. 
We now apply the divergence theorem. 


Theorem 19.9.2 Suppose that U is an open subset of R? with B = 
U CV, and with boundary M consisting of a finite disjoint union of 
2-manifolds. Then 

[00x Pw) desu) = — fF) x nz) done) 

B / 


Proof Let v be a unit vector in R®. Using the preceding proposition and 
the divergence theorem, 


( (7 x Piy)avaty).») = [ ((V x F)(y),v) dvs(y) 
- [| view ) x v) du3(y) 
ay (F(x) x v,nf) doo(z) 


--| (F(x) x nf,v) doo(z) 
M 


=-(f (Fe) xnt )do2(x), v). 


This holds for all unit vectors v, and so the result follows. 


Suppose that 29 € V. We apply the theorem to balls B,(x9) C V. Suppose 
that B,(xo) C V and that G is a continuous vector field on V. We set 


1 3 
¥5(B- (0) oon Cy) des) = Fos i Gly) dus(y), 


1 1 
ar(G)(xo) = me as G(x) do2(x) = es G(x) do2(x). 


Ar(G)(ao0) = 


19.9 Curl 589 


A,(G) is the average value of G on the ball B,(xo) and a,(G) is the average 
value of G on the sphere S;(x0). 


Corollary 19.9.3 —3a,(F x nt)(x0)/r + (V x F)(xo) as r > 0. 


Proof For A,(V x F)(xo) = —3a,(F x nt)(xo)/r and A,(V x F)(x9) > 
(V x F')(ao) as r > 0. 


This expresses V x F' as a limit of volume integrals. 

It is more informative to express the components of V x F' as a limit 
of planar line integrals. In order to simplify notation, let us suppose that 
xo = 0, and let us consider the component of V x F in the eg direction. By 
Proposition 19.9.1, 


(V x F, €3) = VF x €3) = V.(foer = fie2) = Se 


Ify = yrer+yoe2 € UNex, let G(y) = fo(y)e1—fi(y)e2. The two-dimensional 
divergence of G in the plane satisfies V.G = (V x F,e3). Suppose that 
N,,(0) CU. If 0 <r < 19, let y, : [0,27] > FE be the circular path y,(s) = 
r(cos se, +sin s e2) in UNe#, and let B, be the disc {w : w € ex, ||w|| < r}. If 
ar(s) € [Yr], then nt 


Yr (8) 
divergence theorem, it follows that 


= cos(s) e1+sin(s) eg. Applying the two-dimensional 


| he Kae ie) = V.G(y) dvo(y) 
B,. Be 
Pra 


=r / (falre(s)) — fa(re(e))) ds. 


Now 4j.(s) = r(—sin se, +cos seg), and the unit tangent t(y,(s)) at y,-(s) is 
—sin se, + cos seo, so that 


QT 
| iT eG). | TCRONC HONE. 
B 0 


r 


Theorem 19.9.4 


((V x F)(0),e3) = lim —y J (F(rr(s)), (8) ds 
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Proof For 


Ap=—q | (Vx F\y),€s) daly) > (V x F\O), 3) 


ar? Jp 


r 


as r — 0. 


Exercise 


19.9.1 Suppose that F and G are vector fields on an open set U in R® and 
that f is a continuously differentiable function on U. 
(i) Show that V x (fG) = Vf x G+ f(V.G). 
(ii) Let P 7 5 
G.V= N55, + Ra a oe 
Show that V x (Fx G) = (V.G)F — (V.F)G+(G.V)F — (F.V)G. 


Appendix B 


Linear algebra 


B.1 Finite-dimensional vector spaces 


We are concerned with real vector spaces, but the results extend readily to 
complex vector spaces, as well. We describe briefly the ideas and results that 
we need!. 

Let K denote either the field R of real numbers or the field C of complex 
numbers. A vector space E over K is an abelian additive group (£,+), 
together with a mapping (scalar multiplication) (A,x) > Ax of K x E into 
E which satisfies 


u+y) =Ax+ dry, 


for A,u € K and x,y € E. The elements of F are called vectors and the 
elements of K are called scalars. 

It then follows that 0.2 = 0 and \.0 = 0 for  € E and \ € K. (Note 
that the same symbol 0 is used for the additive identity element in FE and 
the zero element in K.) 

A non-empty subset F’ of a vector space E is a linear subspace if it is 
a subgroup of & and if Ax € F' whenever A € K and zx € F. A linear 
subspace is then a vector space, with the operations inherited from FE. If A 
is a subset of E then the intersection of all the linear subspaces containing 
A is a linear subspace, the subspace span (A) spanned by A. If A is empty, 


1 For a fuller account, see, for example, Alan F. Beardon, Algebra and Geometry, Cambridge 
University Press, 2005. 
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then E.4 = {0}; otherwise 


Ey, = {Aya +-+- + Anan: Nn EN,A; € K,a; € A for 1 <i<n}. 


A subset B of EF is linearly independent if whenever 61,...,b, are distinct 
elements of B and \j,...,A% are scalars for which 


Aib1 +°+- +AnbdE = 0 


then \y =--- =A, =0. A subset B of E which is linearly independent and 
which spans F is called a basis for E. 

A vector space E£ is finite-dimensional if it is spanned by a finite set, 
Every finite-dimensional vector space FE has a basis. 


Proposition B.1.1 Jf A is a linearly independent subset of E contained 
in a finite subset C of E which spans E then there is a basis B for E with 
ACBCC. 


Proof Consider a maximal linearly independent subset of C which 


contains A, or a minimal spanning subset of C' which contains A. 


Corollary B.1.2 Every finite-dimensional space E has a basis. 


Proof Take A=Q, C a finite spanning set. 


When we list a basis as (b1,...6a), we shall always suppose that the 
elements are distinct. 


Proposition B.1.3 If B = (b,...,bq) is a basis for E, then any element 
x of E can be written uniquely as x = 21b; +---+2qbq (where 11,...,2q 
are scalars). 


Proof Since B spans FE, x can be written as x = x 1b; +--+ + xqbg. If 
g = eb) +-+++2%bg then (2, — v)bi +--+ + (tq — x/,)bg = 0, so that 
x4 — x, = 0 for 1 <i < d, by linear independence. 


Proposition B.1.4 Jf B= (bi,...,b,) andC = (c1,....q) are finite bases 
for E then k =1. 

Proof For 1<i<k we can write b; = ae jicj, and for 1 <j <1 we 
can write cj = soe Bmjbm. Then 


l 


i hk 
bj = Sa, Bui) = >, Bm Vii | Om- 
j=l m=1 m 1 


=] j= 
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Since (b1,...,b,) is a basis, the expression for b; is unique, and so 
1 = 2S BizVji- Consequently, k = ee Biz7ji). Similarly, | = 


pa Ope 1j:Pij), and sok = 1. 


Corollary B.1.5 Jf B= {b,,...,ba} is a basis for E, and A is a linearly 
independent subset of E then A is a finite set, and |A| < |B]. 


Proof Suppose that F' is a finite subset of A. By Proposition B.1.1, there 
exists a finite basis G of E with F CG C F UB. Then |F| < |G| = |B. 
Since this holds for all finite subsets of A, A is finite, and |A| < |B]. 


Thus any two bases have the same number of elements; this number is 
the dimension dim FE of E. If dim EF = d, we say that EF is d-dimensional. 


Corollary B.1.6 [fC is a spanning subset of a k-dimensional vector space 
E and |C| =k then C is a basis for E. 


Proof For C contains a subset B which is a basis, and |B| = k = |C|, so 
that C = B. 


As an example, let E = K“%, the product of d copies of K, with addition 
defined coordinatewise, and with scalar multiplication 


Miss cay tg) = (AD e245 Ne 
Let e; = (0,...,0,1,0,...,0), with 1 in the jth position. Then K“ is a vector 


space, and (e1,...,eq) is a basis for K“%, the standard basis. As another 
example, let Ma, = Ma,,(4) denote the set of all K-valued functions on 
{1,...,d} x {1,...,k}. Ma, becomes a vector space over K when addition 


and scalar multiplication are defined coordinatewise. The elements of Mz, 
are called matrices. We denote the matrix taking the value 1 at (7,7) and 
0 elsewhere by £j;;. Then the set of matrices {ij :1<i<d,1 <j < k} 
forms a basis for Mg ,, so that Ma, has dimension dk. A matrix t in Ma, 


is denoted by an array 
ty, «--.) te 


ta =... tak 
Ifl<i<dand1<j<hk, let 
| * | 


fe = Caigny | eel | : | 
taj 


r; is the ith row of t and c; is the jth column of t. 
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We denote Maa by Mg: Mg is the vector space of square matrices. We 
define the identity matrix I to be J = Bae SOLED 


B.2 Linear mappings and matrices 


A mapping T : E + F, where F and F are vector spaces over the same field 
K, is linear if 


T(e+y)=T(x)+T(y) and T(Az) =AT(z) for all A € K,2x,y € E. 


The image T(E) is a linear subspace of F' and the null-space N(T) = {x € 
E : T(x) = 0} is a linear subspace of FE. If E is finite-dimensional, then the 
dimension of TF) is the rank of T and the dimension of N(T) is the nullity 
mT yor T. 


Theorem B.2.1 (The rank-nullity formula) JfT : E > F is a linear 
mapping and if E is finite-dimensional then 


rank(T) + n(T) = dim E. 


Proof Let B be a basis for E. Then T(B) spans T(E), and so T(£) is 
finite-dimensional. Let (y1,...,y,) be a basis for T(E) and let (x1,...,2n) 
be a basis for N(T’). For each 1 < j < r there exists z; € E such that 
T(2;) = yj. We show that (21,...,27;%1,-+.,2n) is a basis for &, so that 
rank(T) + n(T) = dim E. 

Suppose that x € F and that T(x) = Ayyy +--+ + Ayr. Let v = A1z1 + 
-->+A,zp. Then T(v) = T(x), so that 


u=x-—v€ N(T) =span(z1,...,2n). 


Thus @ = w+ © spat (2hy.ss pS Mig. 252), Omid (iy... 2 5 Bey Big ors Ba) 
spans EL. If 


B= (Apz1 +++ + Apep) + (iti + +++ + Unt) = 0 


then T(x) = Aqyyi +--+ Ary, = 0, so that A; = 0 for 1 <i <7, 
and x = [12%] +-++ + fn%n = O. Hence pw; = 0 for 1 < 7 < n. Thus 


(Z1,--+)2p,X1,-+-,2n) is linearly independent. 


A bijective linear mapping J: E — F is called an isomorphism. A linear 
mapping J : E — F is an isomorphism if and only if J(£) = F and 
N(J) = {0}. If J is an isomorphism, then dim F = dim F’.. For example, if 
(fi,.--, fa) is a basis for F then the linear mapping J : K4 — F defined by 
J(M1,.--,Aa) = Afi +--+ + Agfa is an isomorphism of K% onto F. 


B.2 Linear mappings and matrices 595 


The set L(E, F’) of linear mappings from F to F is a vector space, when 
we define 


(S+T)(x) = S(a)+T(x) and (AS)(x) = A(S(2)) 


for S,T € L(E,F),c Ee E,X€ K. fT € L(E,F) and S € L(F,G) then the 
composition ST = SoT is in L(E,G). We write L(E) for L(E, E); elements 
of L(E) are called endomorphisms of E. 

An element T of L(E,F) is invertible if there exists an element T~! € 
L(F, E), the inverse of T, such that T~! o T = Ip, the identity on E, and 
ToT! = Ip, the identity mapping on F. T is invertible if and only if it is 
a linear isomorphism of EF’ onto F. 

The set of invertible elements of L(£) is a group under composition, 
with identity element Iz. It is called the general linear group GL(E). When 
E = K4%, it is denoted by GLq(K). It follows from the rank-nullity formula 
that if T € L(E£), then T is invertible if and only if it has a left inverse, and 
if and only if it has a right inverse. 

Suppose that EF and F are finite-dimensional vector spaces over K, and 
that (e1,...,e%) is a basis for E, (f1,..., fa) a basis for F and that T € 
Le, P). Let Te) = Stes fh 


k 


Ifx= Se then T(x =) St fi- (*) 


j=1 =1 \g=l 
Proposition B.2.2 The mapping T — (tj;) is then an isomorphism of 
L(E,F) onto Max, so that dim L(E, F) = dk = dim E.dim F. 


Proof The mapping T — (t;;) is clearly linear and injective. On the other 
hand, if (tj) € Ma, then the formula (*) defines an element T € L(E, F) 
whose image is (t;;), and so the mapping is also surjective. 


We say that T is represented by the matrix (t;;). If (g1,..., gr) is a basis for 
G, and S € L(F,G) is represented by the matrix (s,;) then the product R = 
ST € L(E,G) is represented by the matrix (r;;), where rp; = | Spilaz. 
This expression defines matrix multiplication. 

A matrix t in Mj is invertible if the element T € L(K®@) which it defines 
is invertible. This is so if and only if there is a matrix t~! in Mg such that 
tt-' = t-'t = I. The matrix t~! is then unique: it is the inverse of t. 

As an example, suppose that (e1,...,eq¢) and (fi,..., fa) are bases of 
E. Then the identity mapping I : (E, (he veep da) > UE Nets<a<5 8g) 48 
represented by a matrix b. J is invertible, and so therefore is b. Then b7! 
represents the mapping I : (F,(e1,...,ea)) > (E,(fi,.-., fa)). Suppose 
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now that T € L(£) and that T is represented by the matrix t with respect 
to the basis (e1,...,¢@q) . Then, considering the composite mapping 


(E, (fi,--+,fa)) 7 (E, (e1,---,€4)) 5 (E, (e1,---,€a)) (BE, (fs. fa), 


we see that J is represented by the matrix b~'tb with respect to the basis 
(Jigteongd): 

If T € L(K®%) and T is represented by the matrix t = (t;;), then t can be 
written as a finite product of matrices of a particularly simple form. 

A matrix of the form I + AE;;, where 4 is a scalar and 7 ¥ j, is called an 
elementary shear matrix. Such a matrix is invertible, with inverse I — Ej;. 
The corresponding element of L(K“) is called an elementary shear operator. 
The matrix product (J + Fj; )t is the matrix obtained by adding \ times 
the jth row of t to the ith row, and leaving the other rows unchanged. 
This multiplication is call a row operation. Similarly, the matrix product 
tI + AEj;) is the matrix obtained by adding \ times the ith column of t to 
the jth column, and leaving the other rows unchanged. This multiplication 
is call a column operation. 

A matrix of the form A, Fy, +--- + AqgEaa, where A1,....Aq are scalars, 
is called a diagonal matriz and is denoted by diag(Aj,--- Aq). If it is invert- 
ible, the corresponding element of L(K®) is called a scaling operator. The 
matrix product diag(\;,---Aq)t is obtained by multiplying the ith row of t 
by Aj, for 1 < i < d, and the matrix product tdiag(A1,--- Aq) is obtained 
by multiplying the jth column of ¢ by Aj, for 1 < j < d. The matrix 
diag(Aj,--- Aq) is invertible if and only if each A; is non-zero, and the inverse 
is then diag(\;',--- Aj"). 


Theorem B.2.3 [ft © My thent = pAq, where p and q are finite products 
of elementary shear matrices, and X is a diagonal matriz. 


Proof We show that there exist finite products p and g for which ptg = d, 
a diagonal matrix. Then p and q are invertible, their inverses p and q are 
finite products of elementary shear operations, and t = pdq. 

If t = O there is nothing to prove. Otherwise, by using a row operation 
and a column operation if necessary, we obtain a matrix t’ for which t{, 4 0. 
By using row operations and column operations, we obtain a matrix t” for 
which tj = 0 and t, = 0 for 2 < i,j < d. Now repeat the procedure, to 
obtain a matrix ¢t’/” for which ti =( ford = 1,2. tory = 1,2 snd a= 5, and 


then iterate. 


B.3 Determinants 597 


B.3 Determinants 


If o € Na, the group of permutations of the set {1,...,d}, we define the 


signature €, to be 
_ a(j) — o(#) 
w= TT (=a). 


1<i<j<d 


Then ¢€, = +1. If o,r € Ng then 


1<i<j<d j-t 


Thus the mapping 0 — €, is a homomorphism of the group ‘gq into the 
multiplicative group Dz = {1,—1}. The kernel 


Aq = {o € Dg: &o = 1} 


is the alternating group. If o is a transposition, then «, = —1. Any permuta- 
tion can be written (in many ways) as a product of transpositions; it follows 
that the number of transpositions is always even if €, = 1 and is always odd 
ife, = —l. 

Suppose now that t € Maq. The determinant det t is defined as 


dett = S° eotio(1) --- td,o(d)- 
oeXa 


Note that, since €, = €,-1, 


det t = ~, €otg-1(1),1-+-bte-1(d),d = > €oto(1),1 +++ bo(d),d: 
oeda oeda 


Here are some basic properties of the determinant function. 


Theorem B.3.1 Suppose that t,u © Maa, that \ = diag(A1,...,Aqa) is a 
diagonal matrix and that s =I + pj; is an elementary shear matric. 

(i) det A = Ay... Aq and det tA = det t. det X. 

(ii) If t has two equal columns, then dett = 0. 

(iit) det s = 1 and det ts = det t. 

(iv) det tu = det t. det u. 
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Proof (i) It follows from the definition that det A= A1... Xa. 


det tA = Se Egtt o(1)Xo(1) . ++ bdo(d) Ao(d) 
cea 


= ,...rg S- €oty,o(1) -+-td,o(d) = det t. det X. 
oeXa 


(ii) Suppose that the Ath and the /th columns are equal. Let 7 be the 
transposition (k,!) ; then tj; = t,,(;), for 1 < i,j < d. Then 


det t = S- Eat o(1) +++ bd,o(d) + > €rottr(o(1)) +++ fa,r(o(d)) 


oe Ag o€Ag 
= S° €ott o(1) eee td,o(d) + €, Ss" €ott o(1) a8 td,o(d) = 0. 
oe Ag acAa 


(iii) It follows from the definition that det s = 1. 
det ts = a €oto(1),1 bias (loiia + Lte(3),) were td,o(d) 
o€ha 


= dett+ pu > €ota(1),1 +++ boli) i++ tog) i++ to(d)d = det t, 
o€Xa 


since the second sum is the determinant of a matrix with two equal columns. 
(iv) By Theorem B.2.3, we can write u = pAq where p and q are products 
of elementary shear matrices, and A is a diagonal matrix. Then 


det u = det pAq = det pA = det p. det \ = det A 


and 
det tu = det tpAg = det tpA = det tp. det \ = det t. det u. 


The determinant determines whether or not a matrix is invertible. 


Corollary B.3.2 A matrix u in Maga is invertible if and only if its 
determinant is non-zero. 


Proof By Theorem B.2.3, u = pAq, where p and q are products of ele- 
mentary shear matrices and \ = diag(\1,..., Aq) is a diagonal matrix. Thus 
det u = det p. det A\det g = det A = A,... Ag. Since elementary shear matri- 
ces are invertible, wu is invertible if and only if d is. But d is invertible if and 
only if Ay...Aqg # 0. 
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If u is invertible then u.u~! = J, so that 1 = detI = detu.u-! = 
det u. det u~!, and det u~! = (det u)7!. 

Suppose now that T is an endomorphism of E which is represented by a 
matrix ¢t with respect to a basis (e1,...,¢@q) and by a matrix s with respect to 
a basis (f1,..., fg). Then there is an invertible matrix b such that s = b7'tb, 
and so det s = det b~!. det t. det b = det t. This means that we can define the 
determinant det T’ of the endomorphism T' to be det t: the definition does 
not depend on the choice of basis. 


B.4 Cramer’s rule 


Suppose that u € Ma q is invertible. How can we calculate its inverse? 
Suppose that 1 < i,j < d. Define the matrix u() € Maa by setting 


7 1 ifk=iandl=j, 
ules) =. 0 ifk#iand!=j, 
up, Otherwise. 


The matrix uJ) is obtained by changing the terms in the jth column of u. 
Thus 


| ai = Wage, OC tWiget <** tig ] 
7 ia: ee Moga UO Moigad S* “tag 
2, —= 
ak J) — il Beets Uij—1 1 UWig+l eee Ui,d 
deri Se Wppgeg “OO yea =" Ug 
Udi o*t' Udj-1 O Udjti ++ Udd 


Let Uj; = det uJ), Note the change in the order of the coefficients. The 
matrix U is called the adjugate of the matrix wu. It follows from the definition 
of the determinant that detu = ys U;;uijj. On the other hand, if 7 £ k, 
replace the jth column of u by the kth, to give the matrix u*!, Then ul7-*! 
has two columns equal, so that 


d 
det yl’ = Dy Ujitik =). 
i=1 


Thus U.u = (det u)I, and so u-! = (1/det u)U. This formula is known as 
Cramer’s rule. 
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Let us give one explicit example. 


Example B.4.1 Suppose that u is in SO(3), the group of orthogonal 
matrices with determinant 1. Then 


U22U33 — U23U32 U13U32 — U12U33) ~U12U23 — U13U22 

—1 
u =U= U23U31 — U21U33 U11U33 — U13U31 = U13U21 — U11U23 
UW21U32 — U22U31 U12U31 — U11U32) 4122 — U12U21 


B.5 The trace 


If t € Mg, we define the trace tr(t) of t to be the sum of the diagonal terms: 
tr(t) = yy ty. If s,te My then 


d d 


d d 
tr(st) = > Sigtji | = x (>: i) = Gel Es): 
1 j=l 


i=1 \j= i=l 


Suppose now that J is an endomorphism of EF which is represented by a 
matrix t with respect to a basis (e1,...,¢@q) and by a matrix s with respect to 
a basis (f1,..., fa). Then there is an invertible matrix b such that s = b~!tb, 
and so tr(s) = tr(b~'tb) = tr(tbb~) = tr(t). This means that we can define 
the trace tr(T’) of the endomorphism T to be tr(t): the definition does not 
depend on the choice of basis. 


Appendix C 


Exterior algebras and the cross product 


C.1 Exterior algebras 


Suppose that £ is a real vector space. An element of £, a vector, can be 
considered to have magnitude and direction. In the same way, if x and y are 
two vectors in F then they somehow relate to an area in span (2, y). If we 
wish to make this more specific, we certainly require that the area should be 
zero if and only if x and y are linearly dependent. A similar remark applies 
to higher dimensions. We wish to develop these ideas algebraically. 

A finite-dimensional (associative) real algebra (A, 0) is a finite-dimensional 
real vector space equipped with a law of composition: that is, a mapping 
(multiplication) (a,b) + aob from A x A into A which satisfies 


e (a0b)0c=a0 (boc) (associativity), 
eao(b+c)=ao0b+aoe, 
e(a+b)oc=aoc+boe, 

e A\(a0b) = (Aa) ob = ao (AD), 


for \ € R and a,b,c € A. (As usual, multiplication is carried out before 
addition). 

An algebra A is unital if there exists 1 € A, the identity element, such 
that loa = aol =a for alla € A. For example, Ma(R) and Ma(C) are 
unital algebras. Both R and C can be considered as finite-dimensional real 
algebras: R. has real dimension 1 and C has real dimension 2. 

Suppose now that F is a finite-dimensional real vector space. An exterior 
algebra for E is a unital algebra (/\*(£), A), together with an injective linear 
mapping j : E > /A\*(F) with the following properties. 


(t) j(a1) A+++ A f(a) = 0 if and only if 7,...,2% are linearly dependent 
elements in FE. 
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(t) j(E) generates A\*(E): any element of \\*“(F) can be written as a linear 
combination of the identity and products of the form j(#1) A--+ A j(xx). 


We identify E and j(£), and so we write x for j(2). 


Proposition C.1.1 If (/A*(£),A)is an exterior algebra for E and x,y € E 
then xAy = —yA«x. More generally, if x1,...%, € E anda is a permutation 
Of {1,.222,k}, then 


Lo(1) N***A Xo(k) = €oL1 A+++ AX. 


Proof For 


O=(a@+y)A(aty)=aeArtaAytyArtyAy=xrAytyAL. 


The second statement follows from this, since o can be written as a product 


of transpositions which transpose two adjacent elements of {1,...,k}. 


This shows first that any area that x /A y might represent is a signed area, 
so that the value may be positive or negative, and secondly that the order 
of the terms in a product is all-important. 

We must show that an exterior algebra exists, and that it is essen- 
tially unique. It is possible to define the exterior algebra in a coordinate 
free way, but it is probably simpler to use a basis (e€1,...eq) of E. We 
set Q = {1,...,d}. We consider a 2¢-dimensional space /A\*(E) with basis 
{e4 : A C Q} indexed by the subsets of E; thus an element x of /A\*(£) 
can be written uniquely as « = S\ycga@aea. We define the mapping j 
by setting TS ie.) = ys x,e4;3, and define multiplication in in the 
following way. 


e eg is the identity element of A\*(F); 
elf ANBFOD, then e4 Aeg =0; 
e If A and B are disjoint, and if AU B = C, then we can write 


A= {i <...< i,j} 
B={ji<...< dpi} 
C= {ki <...< ky}. 


Here the order of the terms is all-important. If A,B C , we denote the 
sequence (71,---,%4),J1,--+>J|B|), by A#B. Let o be the permutation of 
C which arranges the sequence A#B in increasing order. We define e,Aep 
to be €gec. 
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e We extend multiplication by linearity. If ¢ = S> acazaea and y = 


r\y= S “{xaypea A ep A,B CO}. 


First we show that this multiplication is associative. Suppose that A, B, 
C are subsets of 9. If A, B and C are not pairwise disjoint, it is easy to see, 
be considering various cases, that (e4 \ep) \ec = 0 and eg A\(epA€ec) = 0. 
Suppose that A, B and C are pairwise disjoint, and that D = AUBUC. Then 
(eaA\ep)Aec = €c€p, where o is the permutation obtained by first arranging 
A#B in increasing order, and then arranging (AU B)#C in increasing order. 
Similarly e4 A (ep A ec) = €,ep, where 7 is the permutation obtained by 
first arranging B#C in increasing order, and then arranging A#(BUC) in 
increasing order. Clearly o = T, so that (eg \eg) \ec = eg \(epA€c). The 
associativity of multiplication follows from this: /\“(£) is a unital algebra. 

Next we show that condition ({) is satisfied. First suppose that 71,..., 2% 
are not linearly independent. Then there exist A1,...,A,, not all zero, such 
that s - Aj; = 0. Without loss of generality, A; 4 0. Then 


k 
O=2,A--:-ALRK_1 A> gy) = Ap (1 Nove Ate). 
j=l 
so that 71 A---A xp = 0. 
Secondly, suppose that 71,...,2, are linearly independent. By Proposi- 
tion B.1.1, there exist 7,41,...,2q, so that (xz ,..., 2g) is a basis for E. Let 


Li = an x,;e;. Expanding the terms in the product, we see that 


%1A+::-ALq= (x €oXo(1),1 oe soo) eg = (det X jeg, 


o€ha 


where X is the matrix (a,j). But X is the matrix of the endomorphism 
T € L(E) which maps e; to xj, for 1 < j < d. This is invertible, and so 
det X #0. Thus 41 A---Aavqg #0, and soz, A---Aap 40 

Finally it follows from the construction that the condition (1) is satisfied. 

It is easy to see that the exterior algebra is essentially unique. Suppose 
that (F,A’) is an exterior algebra for E, with mapping j’ : E — F. If 
A= {iy < ... < taj} © Q, let wea) = j/(eq) A’ A’ 7’(en,,). Then 
it follows from ({) that 7(e4) 4 0, and from the construction of A\*(E) 
that (e4) \’ r(ep) = (ea A eg). Thus 7 extends to an injective algebra 
homomorphism of /\*(£) into F. Finally, condition ({) ensures that 7 is 
surjective. 
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As a result, we refer to ‘the exterior algebra’ (\*(£) of FE, rather than ‘an 
exterior algebra’. 
The exterior algebra /\*(£) is a graded algebra; we can write 


A*(2)=R1GEO(EAL)®- ef\*(Ee--e A), 


where A*(E) = span{x, A-:- A am : 21,...0% € E}. That is to say, if 
y € \*(£) then y can be written uniquely as y = yo.1 + ae Yk, Where 
yz € A*(E) for l<k<d. 

If (e1,..., ea) is a basis for EF as above, then expanding the products, it 
follows that A\*(E) = span{e, : |A] = k}, so that A*(E) has dimension 
Gh In particular, A{(E) is one-dimensional, and in this setting, is the span 
of eg; eg is called the unit volume element. As we have seen, if we consider 
a different basis (fi,..., fa), then fo = (detT)eg, where T is the linear 
mapping which sends e; to f;, for 1 < j < d. In particular, if EF is a Euclidean 
space and (e1,...,eq) and (fi,..., fg) are orthonormal bases, then fo = eq 
if T € SO(E), and fe = —eg otherwise. In the former case, the bases have 
the same chirality, or handedness, and in the latter, opposite chirality, or 
handedness. 


Exercise 


C.1.1 The elements of E A EF are called bivectors, and elements of the form 
xAy are called simple bivectors. Suppose that (e€1,...e4) is a basis for 
a four-dimensional space E. Calculate 


(e1 A €2 +€3 A e4) A (€1 A €2 + €3 A €4), 


and conclude that e, A eg + e3 A eg is a bivector, but not a simple 
bivector. 


C.2 The cross product 


We now restrict attention to the case where F& is a three-dimensional 
Euclidean space, with orthonormal basis (e1,¢2,e3). Then /A*(£) is an 
eight-dimensional unital algebra, and 


A *(B) =R1@ BS (ENE) OReg. 


Let fi = e2 Ae3, fo =e3 Ne, and fz = e; A eg. Then (f1, fo, fz) is a basis 
for EA E. If u= ue, + ugeg + uze3 and v = vye, + v2e2 + v3e3 then 


uAv = (ugvg — ugv2) fi + (ugv1 — uv3) fo + (uive — u2v1) fs. 
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Proposition C.2.1 Every element of EA E is a simple bivector. 


Proof Suppose that y = y:fi + yofo + ysfs € EA E. If y, = 0 then 
y = e1 A (yze2 — yres), and if y, # 0 then 


y = (yer — yie2) A (yze1 — yre3)/Y1- 


Suppose that z,y,z€ E. Then zAyAz=v(2,y, z)eq, where v(2, y, z) is 
the signed volume of the parallelepiped defined by x, y and z. Let ¢,,.(a) = 
u(z,y,z). Then dy,, is a linear functional on FE. By the Fréchet-Riesz rep- 
resentation theorem (Theorem 14.3.7), there exists a unique element of EF, 
which we denote by y x z, such that 


ult, y,2) — GygiZ) = (x,y x z) : 


The vector y x z is called the cross product of y and z. Since (y,y x z) = 
yNyAz=O0 and (z,yx z) = zAyAz =), y X z is a vector which is 
orthogonal to span (y, z). 

The quantity (7,y x z) is called the scalar triple product of x, y and z. 
Since tA yAZ= (ay ® 2) eq; 


(x,y x 2) = (y,2 x 2) = (z,0 x y) = det T, 


where T is the endomorphism of E which maps e; to x, e2 to y and eg to z. 

The mapping (y, z) > y x z is a bilinear mapping of E x E onto E, which 
satisfies y x z= —z x y. Similarly, the mapping y A z > y x z is a bijective 
linear mapping of EA E onto E. 

It is important to note that the cross product is not an associative product. 
For example, e; x (e1 X e2) = e1 X €3 = —€2, while (e) x e1) Xeg = OX eg = 0. 

Let us consider the cross product in more detail. Suppose that x,y,z € E. 
If y and z are linearly dependent, then y x z = 0. Otherwise, let g; = y/ ||y||, 
let w = z — (z, 91) 1 and let g2 = w/ ||w||. Then y x z = y x w, and gi and 
gz are orthogonal unit vectors. Let g3 be the unit vector orthogonal to 91 
and ge for which g, A g2 A g3 = eg. Then 


(91,91 X g2)=0, (92,91 X ge) =O and (93,91 x ge) =1, 
so that gi < g2 = g3. Consequently, 
yx z=yx we (llyll- wll) x 92 = (lyll - lwll)gs. 
Now 


2 2 2 2 2 2 2 
Jel" = ll2llP — 2 (2, 91)" + (2, 94)" = [zl — (2, 9)" / ly, 
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and so 
2 2 2 2 
lly x 21° = lly elle — (2,9) 


What can we say about the vector triple product x x (y x z)? 


Proposition C.2.2 Suppose that x,y,z © E. 
(i) x (y x z) = (@,z) y — (@,y) 2. 
(ti)ax(yxz)t+yx(zxa)+2zx (ex y) =0. 


Proof (i) If y and z are linearly dependent, both sides of the equation are 
zero. Otherwise, let us use the notation above. Since 


x= (, 91) 91 + (x, g2) g2 + (x, 93) 93, 


x x (y x z) = (llyll- llewll) (az, 91) (91 x 93) + (x, ge) (g2 X gs)) 
= (Ilyl] - lwll)(— (2, a1) 92 + (x, 92) 91) 
= —(z,y) wt (x,w) y. 
But 


— (t,y) w= — (@,y) 2 + (a, y) (2,91) 91 
and (x,w) y = (x, 2) y — (2,91) (@, 91) ¥ = (@, 2) y — (2,91) (@, ¥) 1- 


Adding, we obtain the result. 
(ii) follows by adding the formulae for each of the three terms. 


Appendix D 
Tychonoff’s theorem 


We prove Tychonoff’s theorem, that the topological product of compact 
topological spaces is compact. The key idea is that of a filter. This generalizes 
the notion of a sequence in a way which allows the axiom of choice to be 
applied easily. 

A collection ¥ of subsets of a set S' is a filter if 


FlifFPe¢FandGDF then Ge f, 
F2ifFeFandGef then FNGEF, 
F3 0 ¢ F. 


Here are three examples. 


e If A is a non-empty subset of S then {F': A C F} is a filter. 

e Suppose that (X,7) is a topological space, and that « € X. The collection 
N, of neighbourhoods of z is a filter. 

e If (s,) is a sequence in S then 


{F': there exists N such that s, € F forn > N} 


is a filter. 


Filters can be ordered. We say that G refines F, and write G > fF, if 
GOOF. 

We now consider a topological space (X,7T). We say that a filter F con- 
verges to a limit x (and write F — x) if F refines N,. Clearly if G refines F 
and F > x then G > x. 

The Hausdorff property can be characterized in terms of convergent filters. 


Proposition D.0.6 (X,r) is Hausdorff if and only if whenever F > x 
and F > y thenuz=y. 
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Proof If (X,7) is Hausdorff, if F > x, and if « # y then there exist 
disjoint open sets U and V with x € U and y € V. Since U € Nz, U € F. 
But then V ¢ F, since UNV = @. Thus F does not refine N,, and so F & y. 

Conversely if (X,7) is not Hausdorff there exist distinct 7 and y such that 
if U € N, and V € N, then UNV is not empty. Let 


F={F:F UNV for some U € Nz and V € Ny}. 


Then F is a filter which converges to both x and y. 


We say that x is adherent to a filter F if « € F for each F € F. 
Proposition D.0.7 If F > «x then x is adherent to F. 


Proof For if F € F and N € N; then N € F and so FON € F. Thus 
FN is not empty, and so x € F. 


Proposition D.0.8 Jf x is adherent to F then there is a refinement G 
such that G > a. 


Proof — Let 


G={G:GD FON for some F € F,N € N;}. 


G is a filter which refines both F and N,. 


Suppose that S and T are sets, and that f is a mapping from S to T. If 
F is a filter on S the image filter f(#) on T is defined by 


FPS eT er). 
It is easy to check that this is a filter. 


Proposition D.0.9 Suppose that X and Y are topological spaces, that f 
is a mapping from X to Y and that x € X. Then f is continuous at x if 
and only if whenever F > x then f(F) > f(x). 


Proof If f is continuous at 2 and F — x then if N € Ny (q) then foN)e 

Nz © F, so that N € f(F). Thus f(F) refines N(,) and f(F) > f(z). 
Conversely if the condition is satisfied, then since N;, > x, f(Nz) > f(z), 

so that if N € Ny) then N € f(Nz), and so f-!(N) € Nz. Thus f is 


continuous at x. 


Proposition D.0.10 A topological space (X,T) is compact if and only if 
every filter on X has an adherent point. 
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Proof Suppose that (X,7) is compact, and that F is a filter on X. The 
collection of closed sets {F : F € F} has the finite intersection property, and 
so has a non-empty intersection. Any point of the intersection is adherent 
to F. 

Conversely, suppose that the condition is satisfied, and that C is a collec- 
tion of closed sets with the finite intersection property. Let D be the set of 
finite intersections of sets in C and let 


F={F:F DD, for some Dé DP}. 
F is a filter: if x is an adherent point then 


£EeMC:Ceck=n{C:Cecth. 


Proposition D.0.11 A topological space (X,T) is compact if and only if 
every filter on X has a convergent refinement. 


Proof — Propositions D.0.7, D.0.8 and D.0.10. 


Compare this with the Bolzano—Weierstrass theorem. 

Recall that if (Xa)aea is a family of topological spaces, we give the prod- 
uct []oc4 Xa the product topology, taking as basis of open sets the sets of 
the form nam. (O..), where Og, is open in Xq,. This means that N € N; 
if and only if N D M%_,7g(No,), where No, € N7.,(v) for some ay,..., An 
in A. 


Proposition D.0.12 Suppose that F is a filter on ||,-4 Xa. ThenF > & 
if and only if Ta(F) > Ta(x) for each a € A. 


Proof If F > a, then ma(F) — aa(x), since tq is continuous. Con- 
versely suppose that ta(F) — 7ma(x), for each a. Then if N € Nz, 
N D m1, (No,), for suitable n, a; and No,. But No, € Ta,(F), since 
Ta,(F) 4 Ta,(z), and so 7;,\(No,) € F. Thus N € F, and so F > a. 


One major virtue of filters is that they allow the axiom of choice, in 
the form of Zorn’s Lemma, to be applied easily. The filters on a set S are 
ordered by refinement. If C is a chain of filters refining F then {G : G € 
G for some G € C} is a filter which is an upper bound for C. Thus by Zorn’s 
Lemma, every filter has a maximal refinement. A maximal filter is called an 
ultrafilter. 


Proposition D.0.13 Every filter has an ultrafilter refinement. 
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Proposition D.0.14 Jf is an ultrafilter on S and A CS then exactly 
one of A and C(A) is inU. Conversely if F is a filter on S with the property 
that if AC S then either A € F or C(A) € F then F is an ultrafilter. 


Proof Since AN C(A) = 0, at most one of A and C(A) can belong to UW. 
Suppose that AMU is non-empty, for each U € U. Then the sets {V : V D 
ANU for some U € U} form a filter which refines U/, and so is equal to U, by 
maximality. Thus A € U. Otherwise, there exists Up € U such that AM Uo 
is empty. Then C(A) > Up, so that C(A) €U. 

Conversely, let G be a refinement of F, and suppose that G € G. If C(G) € 
F, then C(G) € G, giving a contradiction. So G € F, and F is maximal. 


Proposition D.0.15 IJfU is an ultrafilter on S and f is a mapping from 
S to T then f(U) is an ultrafilter on T. 


Proof If BCT then f~'(C(B)) = C(f~'(B)), so that either f-'(B) CU 
or f-'(C(B)) €U. Thus either B or C(B) is in f(U). 


Proposition D.0.16 A topological space (X,T) is compact if and only if 
every ultrafilter on X converges. 


Proof If (X,7) is compact and YU is an ultrafilter on X, then U has a 
convergent refinement, by Proposition D.0.11. But any refinement of U/ is U/ 
itself. 

Conversely, if the condition is satisfied, and F is a filter on X, then 
F has an ultrafilter refinement, by Proposition D.0.13. This converges, by 


hypothesis, and so (X,7) is compact, by Proposition D.0.11. 


Theorem D.0.17 (Tychonoff’s theorem) If (Xqa,Ta)aca is a family of 
compact topological spaces, then [yc 4 Xa is compact in the product topology. 


Proof Let U be an ultrafilter on [[,-4 Xa. Then for each a, Ta (UY) is an 
ultrafilter on Xq, and so it converges, to %q, say. Let x = (aq), so that 
La = T(x). Then U > z in the product topology, by Proposition D.0.12. 


This gives the result. 


Exercises 


D.0.1 Suppose that ¥ is a filter on S, and that f is a mapping from S to 
T. When is {f(F) : F € F} a filter? 

D.0.2 Let (2,,) be a sequence in a topological space (X,7), and let F be the 
filter {A : there exists N such that x, € A for all n > N}. Show that 
tp — x if and only if F > a. 


D.0.3 


D.0.4 


D.0.5 
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A collection 6 of non-empty subsets of S is a filter base if whenever 
B,, By © B then there exists Bs € B with B3 C By, M Bo. Show that 


F={F:F)2B for some Be B} 


is a filter. 

A filter base B is free if Nn{B : B € B} = 0. Characterize compactness 
in terms of the non-existence of certain free filter bases. 

Suppose that for each x in a set X there is given a filter NV, with the 
following properties: 

(a) x € N for each N € N,; 

(b) if N € Nz there exists M € Nz with M C N such that M € N, 
for each y € M. 

Show that the collection of sets {U : U € Ny, forally € U} isa 
topology on X and that the filters Nz are the neighbourhood filters 
for this topology. 
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